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Abstract 

There are compelling reasons to believe that the dark matter of the universe 
is constituted, in large part, by non-baryonic collisionless particles with very small 
primordial velocity dispersion. Such particles are called cold dark matter (CDM). 
The leading candidates are axions and weakly interacting massive particles (WIMPs). 
The word "collisionless" indicates that the particles are so weakly interacting that 
they move purely under the influence of gravity. Galaxies are surrounded by CDM 
and hence, because of gravity, CDM keeps falling onto galaxies from all directions. 
CDM infall produces a discrete number of flows and caustics in the halos of galaxies. 
There are two types of caustics in the halos of galaxies: inner and outer. An outer 
caustic is a simple fold (A2) catastrophe located on a topological sphere enveloping 
the galaxy. An inner caustic is a closed tube whose cross-section is an elliptic umbilic 
(-D-4) catastrophe, with three cusps. 

In space, caustics are the boundaries that separate regions with differing numbers 
of flows. One side of a caustic surface has two more flows than the other. The 
density of CDM particles becomes very large as one approaches the caustic from 
the side with the extra flows. Dark matter caustics have specific density profiles 
and therefore precisely calculable gravitational lensing properties. This dissertation 
presents a formalism that simplifies the relevant calculations, and apply it to four 
specific cases. In the first three cases, the line of sight is tangent to a smooth caustic 
surface. The curvature of the surface at the tangent point is positive, negative, or 
zero. In the fourth case, the line of sight passes near a cusp. For each case we derive 
the map between the image and source planes. In some cases, a point source has 
multiple images and experiences infinite magnification when the images merge. A 



X 

5-S 



1 e-mail : onemli@zephy r . t h. u-psud . fr 



promising approach to reveal the locations of caustics is to observe the distortions in 
the images of extended sources (such as radio jets) passing by the caustics. 
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1 Introduction 



A large amount of astronomical evidence indicates the existence in the universe of more 
gravitationally interacting matter than luminous matter. Most of the matter in the universe 
is dark. The stuff that is responsible for holding galaxies (and clusters of galaxies) together 
is a peculiar kind of matter that we neither see nor detect by any means. This cosmic 
mystery began puzzling astronomers in the 1930s and the case still remains open. For a 
review of the sources and distributions of dark matter, see Sikivie [1] and the references 
therein. 

In 1932, Oort [2] studied the motions of galactic disk stars in the direction perpendic- 
ular to the disk. He applied the virial theorem to the distribution of vertical star velocities, 
and estimated the density of the galactic disk as pdisk — 1-2 • 10~ 23 ^. When the density 
due to the matter seen in stars, interstellar gas, and stellar remnants like white dwarfs is 
calculated, however, one finds an estimate considerably less than the above value. This 
implies the existence of dark matter in the disk. Because this dark matter is in the disk 
rather than the halo, it is expected to be dissipative, hence baryonic in nature. 

In 1933, Zwicky [3] used measurements of the line of sight velocities of galaxies in the 
Coma cluster to estimate the total mass of the cluster using the virial theorem: 



where T and U are the kinetic and potential energy of the system respectively. Bracket ( ) 
denotes time average. In these estimates, however, it is important to remember that the 
virial theorem is valid for a closed system in mechanical equilibrium and that it applies to 
the time average of the system. Whether the observed clusters satisfy these requirements 
is questionable. Nevertheless, assuming the applicability of the theorem, and using for the 
kinetic energy of N galaxies in the cluster: 



{T) = -\{U), 



(1) 



(T) = -N(mv 2 ) , 



(2) 
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the potential energy of the ^N(N — 1) independent galaxy pairs: 



{V) = -\gN(N-1)^, (3) 

and taking N ~ N — 1, one can estimate the total mass: 

M = N(m) ~ . (4) 

By measuring the quantities r and v, it is possible to estimate M. The result obtained was 
about 400 times the mass in the luminous parts of the cluster galaxies. 

During the seventies, the rotation curves (velocity as a function of radial distance 
from the center) of spiral galaxies began to be measured over much larger distances than 
before. Spiral galaxies are large scale structures containing billions of stars, arranged in the 
form of a rotating disk with a central bulge. Assuming that the stars have a circular orbit 
around the galactic center, the rotation velocity of a star can be calculated by balancing 
the centrifugal and the gravitational forces: 

GM(r)m v^ ot m 



(5) 



where M{r) is the mass interior to the orbit radius r, m is the mass of the star and i> ro t( r ) 
is the rotation velocity at r. Here, we used the fact that for cylindrically or spherically 
symmetric distributions, the forces due to the mass lying outside the orbit compensate 
exactly. We have, therefore 

/ GM(r) 

v TO t(r) = y — - — • (6) 

If we assume that the bulge is spherically symmetric with a constant density of p, then 
M(r) = ^7cr 3 p. Hence, for the innermost part of a galaxy, a rotation curve where t> ro t ~ t 
is expected. For the outside of the galaxy, where r > disc-radius, M(r) is equal to the 
total mass of the galaxy M{r) = M galaxy , which is a constant. In this case 



„ rotW = v ^-._L. (7) 
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The data show, however, that v TOt ~ constant for large r. This means that M(r) ~ r, 
which implies the existence of a huge unseen mass extending far beyond the visible region. 
The distribution of globular star clusters suggests a spherical distribution. Theoretical 
model calculations show that pure disk galaxies have a tendency to become bars (i.e., 
within the central nucleus, a bar-like structure forms) [4]. Barred galaxies do exist, but 
they are relatively rare. A spherical halo of dark matter increases the stability of the 
pure disk structure, producing a ratio of barred spirals consistent with observations [5]. 
Considerations of the stellar dynamics in elliptical galaxies also imply that they contain a 
significant fraction of dark matter [6]. 

These observations lead to the hypothesis of a halo of dark matter whose density is 
ddm(r) ~ ^ at large r. The halo distribution is usually modelled by the function 

pUr) = J^fL 2 , (8) 

where r is called the core radius. For our own galaxy, t> rot ~ 220 ^f-, r is a few kpc, and 
p dm (0) ~ 1(T 23 ^. 

According to Wilkinson Microwave Anisotropy Probe (WMAP) results [7, 8, 9], the 
density parameter 

Pc or2 

of the universe has the numerical value f2 U mv = 1.02±0.02. In Eq. 9 p c is the critical density 
for closing the universe and H is the present Hubble expansion rate. Most of the energy 
density of the universe is in the form of vacuum energy f2 A = 0.73 ± 0.04. The density 
due to matter is Q m = 0.27 ± 0.04 and 0.17 ± 0.01 of Q m is baryonic: Q b = 0.044 ± 0.004. 
Since the density of luminous matter is Q\ um . < 0.006, we conclude that some baryons are 
dark. Recall that there is dark matter associated with the disk of a galaxy. Because it is 
in the disk rather than in a halo, this dark matter must be dissipative which presumably 
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means that it is baryonic (disk dark matter must have sufficiently strong interactions to be 
concentrated in a disk by dissipating its energy while conserving its angular momentum). 
Likely hiding places for these dark baryons may be black holes, cold white dwarfs, and 
brown dwarfs (i.e., stars too low in mass to burn by nuclear fusion). Objects of this kind, 
generically called MACHOs (for massive compact halo objects), have been searched for by 
looking for the gravitational lensing of background stars by MACHOs that happen to pass 
close to the line of sight. Many of them have been discovered [10, 11, 12, 13]. We still do 
not know what makes up most of the missing mass in the universe. 

The success of nucleosynthesis in producing the primordial abundances of light ele- 
ments [14], also requires that 0.011 < £l b < 0.12. Studies of large scale structure formation 
support the view that most of the dark matter consists of non-baryonic, collisionless but 
gravitationally interacting particles such as axions and/or Weakly Interacting Massive Par- 
ticles (WIMPs) (like neutralinos, photinos, higgsinos, etc.) [14]. The particles must have 
small primordial velocity dispersion; for this reason, they also are called "Cold Dark Mat- 
ter (CDM)" candidates. We discuss these CDM candidates in the next section. Here, and 
throughout the dissertation, we assume that they exist; and study interesting consequences 
of their motion in the potential well of a galactic halo. We see that these candidates pro- 
duce surfaces (which we call caustics) in physical space where the dark matter density is 
large, because folds exist at the corresponding locations of the sheet on which the particles 
lie in phase space. The CDM caustics may move about, but they are stable. They also have 
specific density profiles, and hence precisely calculable gravitational lensing properties. We 
present a formalism that simplifies the relevant calculations; and apply it to different cases 
to obtain their lensing signatures [15]. 

Let us start with a discussion of the phase space (velocity-position space) structure 
of CDM halos. Since the particles are in three-dimensional (3D) space, the phase space is 
six-dimensional. The primordial velocity dispersion of the cold dark matter candidates is 
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of a very small order [16] 



M(H „>M~r /5 - 7 m 2/ \ (10) 



for axions and 



1/2 / , \ 2/3 



for WIMPs, where t is the present age of the universe, and m a and m x are respectively 
the masses of the axion and the WIMP. Calculations for 5v a (t) and Sv x (t) are given in 
Sections 2.2 and 2.1, respectively. These estimates of primordial velocity dispersions are 
approximate, but in the context of this dissertation and of galaxy formation in general, 5v a 
and 5v x are entirely negligible. Therefore, CDM particles lie on a very thin 3D sheet in 
6D phase space. The thickness of the sheet is the primordial velocity dispersion 5v of the 
particles. Moreover, the present average number density n of the particles can be expressed 
in terms of their energy density in units of the critical density. For axions 

, s 1.5 10 64 ^ / h V /l(T 5 eV\ 



pc 3 \0.7 
see Section 2.2. Likewise, for WIMPs 

, , 1.5 1 50 ^ / /i V /GeV\ 

see Section 2.1. The large exponents indicate that the density of cold dark matter particles 
is enormous in terms of astronomical length scales. Since they are effectively collisionless, 
the particles experience only gravitational forces. These are universal and vary only on huge 
distances compared to the interparticle distance. Hence the sheet on which the particles lie 
in phase space is continuous. It cannot break and its evolution is constrained by topology. 
If each of the aforementioned CDM candidates is present, the phase space sheet has a layer 
for each species, with a thickness proportional to the corresponding velocity dispersion. 

The phase space sheet is located on the 3D hypersurface of points (r, v) : v — 
H(t)r + Av(r, t), where H(t) = ^ is the Hubble expansion rate and Av(f, t) is the peculiar 
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velocity field. Figure 1 shows a 2D section of phase space along the (z, z) plane. The 
wiggly line is the intersection of the 3D sheet on which the particles lie in phase space with 
the plane of the figure. The thickness of the line is the velocity dispersion 8v, whereas 
the amplitude of the wiggles in the line is the peculiar velocity Av. If there were no 
peculiar velocities, the line would be straight, since in that case z = H(t)z. The peculiar 

t z 




Figure 1: The wiggly line is the intersection of the (z,z) plane with the 3D sheet on 
which the collisionless dark matter particles lie in phase space. The thickness of the line 
is the primordial velocity dispersion. The amplitude of the wiggles in the z direction is 
the velocity dispersion associated with density perturbations. Where an overdensity grows 
in the nonlinear regime, the line winds up in clockwise fashion. One such overdensity is 
shown. 

velocities are associated with density perturbations and grow by gravitational instability 
as Av ~ t 2 / 3 . On the other hand the primordial velocity dispersion decreases on average 
as Sv ~ t~ 2 / 3 , consistent with Liouville's theorem. When a large overdensity enters the 
nonlinear regime, the particles in the vicinity of the overdensity fall back onto it. This 
implies that the phase space sheet "winds up" in clockwise fashion wherever an overdensity 
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grows in the nonlinear regime. One such overdensity is shown in Fig. 1. Before density 
perturbations enter the nonlinear regime, there is only one value of velocity (i.e., a single 
flow) at a typical location in physical space, because the phase space sheet covers physical 
space only once. On the other hand, inside an overdensity in the nonlinear regime, the 
sheet wraps up in phase space, turning clockwise in any two dimensional cut (z, z) of 
that space. The physical space coordinate axis z is in an arbitrary direction, and z is its 
associated velocity. The outcome of this process in a galactic halo is an odd number of 
flows at any physical point in the halo [17]. One flow is associated with particles falling 
through the galaxy for the first time (n = 1); another is associated with particles falling 
through the galaxy for the second time (n — 2), and so on (Fig. 2). 

At the boundary between two regions, one of which has n flows and the other n + 2 
flows, the physical space density is very large because the phase space sheet has a fold 
there. At the fold, the phase space sheet is tangent to velocity space and hence, in the 
limit of zero velocity dispersion (5v = 0), the physical space density diverges, since it is 
the integral of the phase space density over velocity space. The structure associated with 
such a phase space fold is called a "caustic" [18, 16, 19]. Generically, caustics are surfaces 
in physical space, since they separate regions with differing number of flows. One side of a 
caustic surface has two more flows than the other. Because caustic surfaces occur wherever 
the number of flows changes, they are topologically stable, in direct analogy with domain 
walls. It is easy to show (Section 3.2) that, in the limit of zero velocity dispersion, the 
density diverges as d ~ ^= when the caustic is approached from the side with n + 2 flows, 
where a is the distance to the caustic. If the velocity dispersion is small, but nonzero, the 
divergence is cut off so that the density at the caustic is no longer infinite, but merely very 
large. 

Zel'dovich [20] emphasized the importance of caustics in large scale structure forma- 
tion, and suggested using the name "pancakes" for them. The reason galaxies tend to lie 
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Figure 2: Snapshot of the phase space distribution of CDM particles in a galactic halo. 
For simplicity, spherical symmetry is assumed: r is galactocentric distance and v is radial 
velocity. The solid line indicates the location of the particles. The dotted line corresponds 
to observer position. Each intersection of the solid and dotted lines corresponds to a CDM 
flow at the observer's location. "Turnaround" refers to the moments in a particle's history 
when it has zero radial velocity with respect to the galactic center. A "caustic" appears 
wherever the phase space line folds back. Particles pile up and hence the density diverges 
there. 

on surfaces [21] such as "the Great Wall" is undoubtedly that the 3D sheet on which the 
dark matter particles and baryons lie in phase space acquires folds on very large scales, 
producing caustics appropriately called Zel'dovich pancakes. Sikivie [16] derived the min- 
imal CDM caustic structure that must occur in galactic halos [16]. There are two types of 
caustics in the halos of galaxies: inner and outer. The outer caustics are simple fold (A 2 ) 
catastrophes located on topological spheres surrounding the galaxy. 

We saw above that where a localized overdensity is growing in the nonlinear regime, 
the line at the intersection of the phase space sheet with the (z, z) plane winds up in a 
clockwise fashion. The onset of this process is shown in Fig. I. Of course, the picture is 
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qualitatively the same in the (x, x) and (y, y) planes. In this view, the process of galactic 
halo formation is the winding up of the phase space sheet of collisionless dark matter 
particles. When the galactic center is approached from any direction, the local number 
of flows increases. First, there is one flow, then three flows, then five, seven. . . . The 
boundary between the region with one (three, five, . . .) and the region with three (five, 
seven, . . .) flows is the location of a caustic, which is topologically a sphere surrounding 
the galaxy (Fig. 2). When these caustic spheres are approached from the inside, the density 
diverges as d ~ in the zero velocity dispersion limit. Outer caustics occur where a given 
outflow reaches its furthest distance from the galactic center before falling back in. 

The inner caustics are rings [18]. They are located near where the particles with the 
most angular momentum in a given inflow reach their distance of closest approach to the 
galactic center before going back out. In the absence of angular momentum, the caustic 
rings collapse to a caustic point at the center whereas the outer caustics are unaffected. A 
caustic ring is more precisely a closed tube with a special structure. Its transverse cross- 
section is a closed line with three cusps, one of which points away from the galactic center 
(Fig. 3). In the language of Catastrophe Theory, such a singularity is called a _D_ 4 (or 
elliptic umbilic) catastrophe [16]. We call it a "tricusp." The existence of these caustics 
and their topological properties are independent of any assumptions of symmetry. Their 
derivations are discussed in Section 3. 

Dark matter caustics have very well-defined density profiles, and hence calculable 
gravitational lensing signatures [22]. In this dissertation we derive these signatures in a 
number of specific cases. In the limit of zero velocity dispersion (Sv = 0), the density 
diverges when one approaches a caustic surface on the side that has two extra flows, as 
the inverse square root of the distance to the surface. This divergence is cut off if there 
is velocity dispersion, because the location of the caustic surface gets smeared over some 
distance Sx. For the dark matter caustics in galactic halos, 5x and 5v are related [18] by 

5x ~ , (14) 

v 
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Figure 3: Cross-section of a caustic ring in the case of axial and reflection symmetry, and 
p, q < a. 

where v is the order of magnitude of the velocity of the particles in the flow and R is the 
distance scale over which that flow turns around (i.e., changes its direction). For a galaxy 
like our own, v = 500 km/s and R = 200 kpc are typical orders of magnitude. 

As mentioned earlier, the primordial velocity dispersion of the leading cold dark 
matter candidates is very small. Using the estimates of 5v given above, one finds that 
axion caustics in galactic halos are typically smeared over 



as a result of their primordial velocity dispersion; whereas WIMP caustics are smeared 
over 



V m x J 

It should be kept in mind, however, that a cold dark matter flow may have an effective 
velocity dispersion that is larger than its primordial velocity dispersion. Effective velocity 




(15) 




(16) 
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dispersion occurs when the sheet on which the dark matter particles lie in phase space is 
wrapped up on scales that are small compared to the galaxy as a whole. It is associated 
with the dumpiness of the dark matter falling onto the galaxy. The effective velocity 
dispersion of a flow may vary from point to point, taking larger values where more small 
scale structure has formed; and taking the minimum primordial value where no small scale 
structure has formed. For a coarse-grained observer, the dark matter caustic is smeared 
over 8x given by Eq. 14 where 5v is the effective velocity dispersion of the flow. 

Little is known about the size of the effective velocity dispersion of dark matter flows 
in galactic halos. Sikivie [23], however, interpreted a triangular feature in the IRAS map 
of the Milky Way as the imprint on baryonic matter of the caustic ring of dark matter 
nearest to us. The sharpness of the feature's edges implies an upper limit of 20 pc on the 
distance 5x over which that caustic is smeared; and hence an upper limit of order 50 m/s 
on the effective velocity dispersion of the corresponding flow. 

The gravitational lensing effects of a caustic surface are largest when the line of sight 
is near tangent to the surface, because the contrast in column density is largest there. The 
effects depend on the curvature of the caustic surface at the tangent point in the direction 
of the line of sight: the smaller the curvature, the larger the effects. A caustic is an oriented 
surface because one side has two more flows than the other. We considered three cases 
of gravitational lensing by a smooth caustic surface. In the first case, the line of sight is 
near tangent to a caustic surface that curves toward the side with two extra flows. We 
call such a surface "concave." In the second case, the surface is "convex," (i.e., it curves 
away from the side with two extra flows). In the third case, the caustic surface has zero 
curvature at the tangent point (the radius of curvature is infinite), but the tangent line 
is entirely outside the side with two extra flows. Caustic surfaces may have cusps. The 
outer dark matter caustics of galactic halos are smooth topological spheres, which have no 
singularities, but the inner caustics of galactic halos are closed tubes whose cross-section 
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has three cusps. Therefore, the fourth case we consider has a line of sight near a cusp, and 
parallel to the plane of the cusp. 

Gravitational lensing produces a map of an object surface onto an image surface. 
The magnification is the inverse of the Jacobian of this map. Because dark matter caustics 
have well-defined density profiles, it is a neat mathematical exercise to calculate their 
gravitational lensing characteristics. The images of extended sources may show distortions 
that can be unambiguously attributed to lensing by dark matter caustics in the limit of 
perfect observations. We see that in three of the cases considered, a point source can have 
multiple images. In those cases when two images merge, the Jacobian of the map vanishes 
and the magnification diverges. So, at least in theory, it seems that gravitational lensing is 
a very attractive tool for investigating dark matter caustics. Observation of the calculated 
lensing signatures would give direct evidence for caustics and CDM. 

We have been particularly motivated by the possibility [22] that the observer might be 
able to distinguish between axions and WIMPs by determining the distance over which the 
caustics are smeared. The nearby caustic, whose position is revealed [23] by a triangular 
feature in the IRAS map of the Milky Way plane, is only 1 kpc away from us in the 
direction of observation. By observing the gravitational lensing due to that caustic, one 
may be able to measure 5x as small as 10 13 cm, assuming an angular resolution of 3 • 1CT 9 
radians. If 5x turned out to be that small, the WIMP dark matter hypothesis would be 
severely challenged (Eq. 16). 

Unfortunately, as shown below, the gravitational lensing due to a caustic only a kpc 
away from us is too weak to be observed with current instruments. It is well known 
that gravitational lensing effects are proportional to Dj ^ LS where Ds,Dl and Dls are 
respectively the distances from the observer to the source, from the observer to the lens, 
and from the lens to the source. We see below that, for the gravitational lensing effects of 
dark matter caustics to be observable using the current technology, the lenses and sources 
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must be as far away as possible (at the cosmological distances of order Gpc). Even then, 
the observation of such effects will be difficult. Unfortunately, at Gpc distances it is not 
possible to measure Sx as small as Eqs. 15 and 16 with foreseeable technology. So, it seems 
unlikely that one will be able to distinguish between dark matter candidates on the basis 
of the gravitational lensing characteristics of the caustics they form. Henceforth, unless 
otherwise stated, the velocity dispersion is set equal to zero. 

The outline of this dissertation is as follows. Section 2 is devoted to the review of 
the main CDM candidates. In Sections 2.1 and 2.2 we study the WIMPs and axions, 
respectively. Each candidate's status in particle physics and production mechanism in 
the early universe are discussed. Calculations show that their velocity dispersions are 
negligible. In Section 3, we study the caustics associated with the infall of collisionless 
dark matter particles onto a galaxy. In Section 3.1 a general discussion of caustics is given. 
Caustic surfaces are introduced in Section 3.2. We prove the existence of CDM caustics 
and classify them as outer and inner. We show that the outer caustics are topological 
spheres and the inner caustics are rings with three cusps. We find n + 2 flows (where n is 
always odd) inside an outer caustic; and n-flows outside. Estimates of the radii and fold 
coefficients of the outer caustics (based on the self similar infall model) are given in Section 
3.4.2. We find n + 4 flows inside the caustic rings; and n + 2 outside. In Section 3.5.2, 
we give a detailed analysis of the caustic rings under the additional assumptions that the 
flow is axially and reflection symmetric; and that the transverse dimensions of the ring, p 
and q, are small compared to the ring radius a. In Section 3.5.3, we show that under these 
assumptions, the flow near the ring is described in terms of five parameters: a, b, To, u and 
s. We study the differential geometry of the ring surfaces produced by this flow in Section 
3.5.4. We conclude the section by estimating the density profiles of the caustic rings, using 
the self-similar infall model. 

In Section 4.1, the general formalism of gravitational lensing is reviewed. We show 
how the calculations can be streamlined for the case of gravitational lensing by dark matter 
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caustics. In Section 4.2, we calculate the gravitational lensing properties of dark matter 
caustics in the four cases mentioned above. In Section 5, we summarize our conclusions. 
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2 Cold Dark Matter Candidates 

2.1 Weakly Interacting Massive Particles (WIMPs) 

WIMPs is an acronym for weakly interacting massive particles. In this section we derive 
some properties of WIMPs, in particular their velocity dispersion, 5 X = (v%), where 
( ) indicates averaging, and x denotes a WIMP. First, we briefly introduce some relevant 
concepts and calculational tools[14, 24, 25, 26]. 

The key to understanding thermodynamics in an expanding universe is the comparison 
of the particle interaction rates and the expansion rate of the universe [14]. The interaction 
rate per particle of a species is defined as 

r int (0 =n(a int \v\)(t) (17) 

where n(t) is the number density of target particles, v is the relative velocity, and a- mt is the 
interaction cross-section. Since cr int is a function of energy and interacting particles have 
random thermal velocities, averaging of the combination 0-^1^1 is necessary. The expansion 
rate of the universe is given by H(t) = R(t)/R(t), where R(t) is the scale factor of the 
comoving coordinates. Since in the relativistic regime, n(t) scales as R~ 3 (t), and (cri nt v) is 
typically a declining function of energy, r int (t) decreases at least as fast as R~ 3 ~ t~ 3 / 2 . In 
the non-relativistic regime, n(t) and hence r int (i) decreases exponentially. The H(t), on 
the other hand, changes in time as t~ l in both regimes. Thus, T int (t)/ H(t) — > oo, as t — > 0. 
In time, crossover between r int (t) and H{t) is unavoidable. This means that there is a 
certain time t D) called decoupling time, such that Tint^D) — H(t D ). After t D , a particle 
species effectively stops interacting with the rest of the matter; and its distribution evolves 
independently. 

Next let us consider WIMPs in particular. We assume that they carry a quantum 
number, like R parity in the case of the lightest supersymmetric partner (LSP), that keeps 
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them stable. Thus, the number of WIMPs per comoving volume can only change via 
annihilation processes mediated by Z°, such as the two-body final state annihilations: 



where X denotes particle species (like u, e~, n~ , u, d, s, etc.). A bar indicates the anti- 
particle. We represent the average annihilation cross-section summed over all annihilation 
channels (not only the two-body final state annihilations) by (<7 ami \v\)- Let us also assume 
that there is no particle number asymmetry between x's and x's, or that it is negligible. 
When T drops below m x , the x particles become nonrelativistic and their energy density 
p x ~ rn x n x * s Boltzmann suppressed as long as it is in thermal equilibrium. Their number 
density is given by 



where T(t) is the temperature and g x is the number of internal degrees of freedom of the 
x's (such as spin, color, etc.). The number of internal degrees of freedom g, L of a species, 
in general, is an important quantity in the calculations (because each adds independently 
to the number and energy densities, pressure, etc.). For example, photons have two polar- 
ization states, hence g 1 = 2; neutrinos have only one polarization state, therefore g v = 1; 
and electrons and muons have g e -,^ = 2, since they have two spin states. Internal degrees 
of freedom of the antiparticles are counted independently; and their number is the same 
as for the particles. It is clear from Eq. 19 that if a massive species remained in thermal 
equilibrium until the present, its abundance would be negligible. On the other hand, if 
the annihilations of the species stop at a temperature T F = T(tp), called the freeze-out 
temperature, n x would decrease only as R~ 3 after ip, exceeding its equilibrium value. Such 
particles could have a significant relic abundance even today. 

Using Eqs. 17 and 19, the annihilation rate of x can be written as 



xx^xx 



(18) 




(19) 




(20) 



20 



The average of (Tannic for the annihilation reactions is given [14] as 



\CTann\v\) = <T ( — ) , (21) 



(where n = for s-wave annihilation, n — 1 for p-wave annihilation, etc.). The <7o for a 
WIMP with mass in the range T < m x < m z o is 

a ~ ^G|m x , (22) 

where GV is the Fermi constant. The value c depends on the number of annihilation 
channels and whether x is distinct from x (Dirac-type) or not (Majorana-type). Eqs. 
20-22 yield 

r ann (T) * (^) V2 " n T 2 m x exp (-m x /T) . (23) 

The Hubble parameter H(T) can be determined from the Friedmann equation for the 
early universe: 

H 2 (t) = S -f-p(t) , (24) 

where G is the Newton's constant and p(t) is the total energy density. Since the energy 
density of a nonrelativistic species in thermal equilibrium is exponentially suppressed com- 
pared to that of a relativistic species (assuming for the moment that no other species is out 
of equilibrium), summing only the energy densities of the relativistic particles in thermal 
equilibrium at a given temperature is sufficient. Hence, the energy density has the form of 
the Stefan-Boltzmann law: 

p(T) ~ ^g cS (T) T 4 . (25) 

Here we introduce the effective degeneracy factor g eS (T), which counts the total 
number of internal degrees of freedom of the particles that are relativistic and in thermal 
equilibrium at temperature T (particle species whose mass m; T). The expression for 
g e s(T) also contains the factor 7/8 for fermions relative to bosons. It is useful to calculate 
g e s(T) for a temperature (say, 1 TeV) at which all the particles of the Standard Model were 
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relativistic and in thermal equilibrium. The total number of internal degrees of freedom of 
the gauge and Higgs bosons is 28 (the Higgs boson is spinless, hence it has just one spin 
state. The W ± and Z° bosons are massive spin s = 1 particles, hence each have 2s + 1 = 3 
spin states. The photon and each of the eight gluons are massless spin 1 particles, therefore 
each has 2 helicity states: +1 and —1. If we add them up, the total number of bosonic 
degrees of freedom is 9-2 + 3- 3 + 1 = 28) and for fermions it is 90. (Each of the six quarks 
comes in 3 colors and 2 spins. Each of the three charged leptons has two spin states. 
Each of the three neutrinos has one helicity state. Therefore, the total fermionic particle 
degeneracy is 6-6 + 3- 2 + 3-1 = 45. Because of the anti-particles, the total fermionic 
degeneracy doubles, hence 2 • 45 = 90). Thus, 

g cS (T = ITeV) = 28 + - • 90 = 106.75 . (26) 

8 

For some species of particles, as happens for neutrinos, the interaction rate becomes 
smaller than the expansion rate. Those particles will have lower equilibrium temperatures 
than the photons (see below), but will still remain relativistic. To allow this possibility, 
we introduce a specific temperature Tj for each kind of relativistic particle and define the 
total number of effective degrees of freedom as 

wt)= e e ■ (27) 

i=boson ^ ' j'=fcrmion ' 

A useful formula can immediately be obtained by inserting Eq. 25 into the Friedmann 
equation (Eq. 24) 

tf 2 = ^-S cff T 4 = 2.76^P (28) 

3 30 y m P1 y 1 

where G is given as the inverse square of the Planck mass = rapi, or 

# = — v^T 2 . (29) 
mpi 
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Since the scale factor R(t) ~ y/i in the era of radiation domination, H(t) = R/R = l/(2t). 
Thus, we find the time-temperature relation in the radiation domination: 

t = 0.3-^_ . (30) 

This formula is valid at temperatures around 1 MeV, where most of nucleosythesis and 
neutrino decoupling occurred. 

Annihilations effectively stop at temperature T F = T(t F ) when the ratio of the rates 
T(T F )/H(T F ) ~ 1. Then, Eqs. 23-25 yield 

r(Tp) g y fm x T\ l/2 ~ n , 

tttt^t - ,\„ A ^- ™v exp (-m Y T) ~ 1 . 31 

If the x's are Dirac-type particles, c ~ 5 and n = 0. Taking g x = 2 and g cff ~ 60, the 
logarithm of Eq. 31 yields 



T, 



F 



In this equation, to leading order, m x /Tp ~ 16.4. Iterating in this order, the logarithmic 
term corrects the result as 

^ 17 . 8 + 31n (^). (33) 

Although n x R 3 freezes out at T F ~ m x /20, the energy distribution of the WIMPs is kept 
thermalized by collisions with v, e,etc. (e.g. , z/ + x ~^ v + x)> down to the decoupling 
temperature T D ~ MeV where all the WIMP interactions effectively stop and WIMPs 
stream freely. Neutrino decoupling takes place also at this temperature. To see that 
the decoupling temperature of neutrinos is indeed around an MeV, consider, for example, 
the interactions between relativistic charged leptons and neutrinos. A typical process 
maintaining thermal equilibrium, such as v~ + e + — > + /i + mediated by W + , has a weak 
interaction cross-section: 

a 2 s a 2 s 
(s - m l w y m^y 
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where s is the square of the total four momenta of the incoming or outgoing particles 
(which is also equal to the square of the total energy in the center of momentum frame), 
m w is the mass of the W + boson and a = 1/137. Since s is of order the energy squared 
of the reacting particles, and average energy is proportional to T, the cross-section can be 
expressed as 

a 2 T 2 

Cweak ~ 4— • (35) 

The interaction rate T wea k = n{a we3 ^\v\) is thus 



a 



2^5 



1 weak ~ 4— , (36) 

where we have used \v\ — c — 1 and the number density for a relativistic fermion species 
rti — I ^^#jT 3 j ~ T 3 . Comparing the above rate with the Hubble expansion rate 



H ~ T 2 /mp\ (using Eq. 29), we have 

T wcak a 2 T 3 m P i 



H 



(37) 



The decoupling occurs when the above ratio drops below unity, implying 

m w X 



■ D 



a 2 mp\ 



) 



3 



4 MeV . (38) 



Turning back to the case of WIMPs, being non- relativistic at t D , the x s satisfy 

1 3 

-m x (v 2 x ) TD = -T D . (39) 



Therefore the velocity dispersion of the WIMPs at to is 



3711 (40) 



For any free particle moving in an expanding universe, momentum decreases in inverse 
proportion to the scale factor: p(T) = p(T D ) (R(T D ) / R(T)) . The velocity dispersion of the 
WIMPs as a function of time is therefore 

^=ilS' (4l) 
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The ratio R{t D ) / R{t) can be determined using conservation of entropy, which implies 
T ~ 1/R (see below). For a numerical estimate of the ratio today, depending on the 
decoupling era of x s , we can either use the present temperature of the CMBR photons 
Tq 7 or the temperature of the relic neutrinos T 0u . The latter can be calculated from 
T 07 . The difference between T 07 and T 0u is due to the early decoupling of neutrinos and 
the subsequent annihilation of electron-positron pairs: e~e + — > 77 near T = 1 MeV. 
To understand this so called "reheating" effect of photons, we recall that the expansion 
of the universe is adiabatic, and hence the entropy of particles in thermal equilibrium 
S(T) = R 3 (T) (p(T) +p(T)) /T is conserved. Using the relativistic expressions for the 
energy density p(T) = n 2 g cS T A / 30 and the pressure p(T) = p(T)/3, the entropy can be 
written as S(T) = 27r 2 g* eS (R(T)T) 3 /45. Here we define 



is a conserved quantity. After decoupling, neutrinos move freely. They remain in a thermal 
Fermi-Dirac distribution with a temperature T u = T D R(T D ) / R(T) = KR~ l (T) provided 
T v 3> m U) and their entropy S V (T) is separately conserved. If we consider the rest of 
the particles in thermal equilibrium, using Eq. 43, we calculate their temperature as 
T = (C/g* ff ) 1//3 .R _1 (T). Since at the time of decoupling, photons, neutrinos, and the 
rest of the matter had the same temperature, (C / g* s (T D )Y^ 3 = K. Thus, in spite of 
the decoupling of neutrinos, as long as g C Q does not change they all continue to have 
the same temperature. However g e s changes when the temperature of the universe falls 
below T ~ m e . Below this temperature, the mean energy of photons is not sufficient to 
create e _ e + pairs. Hence annihilations e~e + — > 77 exhaust the e~e + pairs. Therefore, for 
T D > T > m e , photons (g 7 = 2) are in equilibrium with electrons (g e - = 2) and positrons 




(42) 



Thus, 



g: s (R(T)T) 3 = C 



(43) 
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(g e + = 2) giving g* s = 2 + (7/8) • 4 = 11/2. For T <C m e , however, only photons are 
relativistic, giving g* s = 2. If we apply the conservation of entropy Eq. 43 to the particles 
which are in equilibrium with radiation before and after e~e + annihilation, we find 

(44) 



\3 



(i?(T)T 7 )^ me (g* eS )T« me 4 
Just before e~e + annihilation, neutrinos (although they were decoupled) had the same 
temperature as photons: 

(R(T)T„) T > me = (i?(T)T 7 ) T > me = K . (45) 
Using the above equation in Eq. 44, we obtain 



(R(T)T,) T<<me = - {R{T)T u ) T> _ me 



iiy/s 

4 J 



T J (i?(T)T,) T<<me . (46) 

In the last equation the constancy of (R(T)T U ) is used. After e~e + annihilation, g* ff does 

not change any more. Therefore, the relation 

4 \ V3 



^ = VIT J T ^ (47) 

remains valid even today. 

Now, let's turn back to the estimation of the velocity dispersion given in Eq. 41. If 
Xs decouple before T > m e , they are not affected by the reheating. Therefore, 



^x = W^S i - (48) 

Let us estimate the present value of the above 5v x numerically. We take T D ~ lMeV and 
m x ~ lOOGeV. Since the present value of T 7 = 2.725K = 2.348 • l(T 13 GeV, using Eq. 47 
we find T v = 1.945K = 1.676 • lCT 13 GeV. Thus, from Eq. 48, we obtain the present value 
of the velocity dispersion as 



. . 19 lGeV /lMeV , . 

5v x (t Q ) = 9.180 • 10- 12 J , (49) 
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where t denotes the present age of the universe. To insert the time dependence of S x back, 
using Eq. 41, we write 



3T D R(t D ) _ 3T D R(t D ) R(t ) 



Because y ^~ ^r^) i s the present value of the velocity dispersion Sv x (t ) estimated in Eq. 
49 and R(t) ~ t» in the era of matter domination, we have 



c / x c / sR(to) «■ , xA>V 12 /lGeV /lMeV/t V 



. 1(r n^y^y(|, . (51) 



If on the other hand, x s decouple after T ~ m e , they benefit from the reheating 
caused by e + e~ annihilation, and 



Then, in this case, 



which is not very different from the value obtained in Eq. 51. 

The present average WIMP number density n x (t ) can be expressed in terms of Q x = 
p X [t°) ; which is the present WIMP energy density in units of the critical density p c : 

n x (t ) = PJ^ = . (54 ) 
m x m x 

Recall that the Friedmann equation for a flat universe is 

H\t Q ) = ^p c (to) , (55) 
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where H(t ) = -|j|jjy is the present Hubble expansion rate of the universe. Therefore the 
present critical density is 

p c (t ) = 8.1 • Kr n /i 2 (eV) 4 = 4.85 • lO 82 ^^ , (56) 

where we used H(t ) = h 100^™^ = h 3.34 • lCT 10 ^. Using Eqs. 54, 56 and the conversion 



eV = 1.564 • 10 23 ^ we find 

pc 



. , 1.5 1 50 „ / h \ 2 /GeV\ 



We next discuss another leading species of cold dark matter candidates, namely ax- 
ions. 

2.2 Axions 

Axions have been postulated to solve the strong CP problem of Quantum Chromodynamics 
(QCD). QCD is the theory of strong interactions. It is a gauge theory on the gauge group 
SU(3) of "color." Its dynamical variables are a color octet of gauge fields A a , called gluons, 
and a family of color triplet spinor fields qf, called quarks. The flavour index /, labels the 
various triplets. A® is analogous to the vector potential of electrodynamics. Here, in 
addition to the spacetime index /i, the gauge field carries the SU (3) group index a. In an 
analogy with Quantum Electrodynamics (QED), the action for the gauge sector is at first 
taken to be 



r 

-"-gauge 



~ J d'xG^G^ (58) 



where 

G% = d,Al - d v Al + 9 r hc A\Al (59) 

is the gluon field strength tensor, g is the gauge coupling, and f abc are the structure 
constants of the SU{3) algebra: [T a ,T b ] = if abc T c . The analogy between QCD and QED, 
however, fails where the vacuum structure of the theories is concerned. In both theories, the 
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vacuum configuration of the theory is characterized by the vanishing of the field strength 
tensor F^. In QED, however, all the vector potentials which yield F^ u = 0, are related to 
each other by a gauge transformation. For any vacuum configuration, can be made to 
vanish by a gauge transformation; see the Appendix. Thus the QED vacuum is unique. 
In QCD, on the other hand, not all solutions of G a = are related to each other by a 
gauge transformation. These physically distinct vacuum states can be classified by the 
set of integers, n, and hence the most general ground state |6), called 9- vacuum, can be 
expressed as a superposition of all the degenerate states \n): 

oo 

|0) = exp(-m0)|n) , (60) 

n=— oo 

where the extra parameter 6 has period 2n, and needs to be measured; see the Appendix. 
The effects of the 0-vacuum can be included in the Lagrangian density of QCD by the 
addition of a topological term 

£e = ~^ G ^ G ^ = d " K " ' (61) 



where (Eq. 480) 



= -e^ pc ^{A v d p A a - i-^A v A p A a ) . (62) 



Such a term affects neither the equations of motion nor the perturbative aspects of the 
theory (Feynman rule for the vertex associated with a four- divergence would necessarily 
have a multiplicative factor of the sum of the momenta at the vertex which vanishes 
automatically due to momentum conservation). Therefore, the existence of the 0-term in 
a Lagrangian does not necessarily mean that physics depends on it. As remarked in the 
above, Classical Electrodynamics and perturbative QED are unaffected by the presence 
of such a term in their Lagrangians. In QED, if there are non-perturbative effects, they 
are absolutely negligible. This is because, the essential structure of a non-perturbative 
effect can be represented by e~~ , which vanishes as the coupling constant a e — > (when 
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the interaction is turned-off) and can not be expanded in a Taylor series in a e . Since 
a e = 1/137, the ©-dependence would be highly suppressed. Non-perturbative effects, 
however, produce 6-dependence in QCD. These effects must be present in the theory, 
otherwise QCD would not produce the phenomenology at low energies. 

To illustrate the claim given in Eq. 61, we evaluate the vacuum to vacuum transition 
amplitude in the SU (3) gauge theory: 

(eV^ie) = ^2e ime 'e lse (m\e- lHt \s) . (63) 

m,s 

Recall that the expectation value (m\e~ lHt \s) can be written as the path integral over all 
\Af~\ that connects the mth vacuum with the sth vacuum: 

L f-J s—m 

( m | e -^| s ) = J [DA^\ s _ m exp(-i J C„d A x) . (64) 
Then Eq. 63 becomes 

(e'|e-^|0> = ^ e -(-™)e e -(e'-e) j f [DA ,] s _ m e W (-i [ £ gmge d*x) 

s,m 

= 5(9-9')^e me f [M^e"^^^, (65) 

n J 

where the delta function is obtained by summing over m, after s was replaced by n = s — m. 
The phase factor e m6) can be absorbed into the action using 

2 r 

n = / d 4 xG apu *G a uu , (66) 

32tt 2 J K ' 

where g is the strong coupling constant and *G apu = \e^ u \ p G aXp is the dual field strength 

tensor. Therefore, the effective QCD Lagrange density is the sum of 

C = -nQ = J d'xG^G^, (67) 

£gau g e and the Dirac Lagrangian density for the quark fields: 

C-QCD = £ [q/iPq, ~ (m f ql f q Rf + h.c.)] - l -G a ^G% - |^G^*G^ , (68) 
/ 

30 



where we define 7^ = 7^ (<9 M - ^T a ), g L/ ee (l/2)(l- 75 ) g/ and to/ = (1/2)(1 + 
7 5 )gj. The scale dependence of the strong coupling constant g is given at one loop by 

tt -<"> = — (11 -§„,),» (69 » 

where n/ is the number of quark flavors with mass less \i and A/ is the appropriate QCD 
scale. The rrif denote the quark masses. They originate in the electroweak sector of the 
Standard Model which must violate CP to explain K L — > 2n. Hence, rrif are complex 
numbers in general. 

In the Standard Model, the 9 term receives a contribution due to electroweak effects 
involving the rrif. To see the reason, consider the effect in the path integral 

Z[V, fj, 3,] = f [Dq] [Dq] [DA] exp (i J d 4 x [C QCD + q V + fjq + 3fM A^ , (70) 

of a redefinition of all the quark fields 

q f -> = exp (i/3fy 5 )q f , (71) 

where aj are a set of real phases. Recall that {7^,75} = 0, 75 = 75 = 7^ and 757^ = 
-7^75. Therefore, 

ft- / 1>. 

n=0 n=0 

= 7^exp(-i/3/(a;)75) , (72) 

hence, 

q' f ( x ) = q'\(x)7° = (e^ (a;)75 g / (x)) t 7 = g/(^)e-^ (:r) ^7 

= g}(x)7°e^ (a;)75 = q f {x)e ip f {x) ^ . (73) 

The transformations of the terms in the exponent of Eq. 70 can then be calculated easily: 

q'fi^q'f = q-fjM^H^d^M^qf (74) 
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= q f jPM^i lix [(id^ f (x) l5 )e^f {xh5 q f + e^^d^f] (75) 

= -(PPf(x))qfe*'l x >»we*' ix) ™<lf + = -(d^f(x))q fW q f + q f ifiq f , (76) 

and 

q'f^q'f = q n ,e-^ x ^A^ x ^q f = q fl ^q f . (77) 
The mass term in Cqcd is 

Anass = ~ ^771/9/(1 + 75) g/ - ^ ^m)q f {\ - 75)9/ . (78) 

/ / 

Since the chiral transformation gives 

m f q f q' f = m f q f e i2 ^ (xh5 q f = m f [cos {Wf)Wf + i sin (2f3 f )q n5 q f } , (79) 

mfq'flsq'f = m f q' fl5 e i2f3 f {xh5 q f = m f [cos (2(3 f )q n5 q f + i sin (2(3 f )q f q f \ , (80) 
the £ maS s transforms to 

Cass = -\j2 m 'f^^ + 75 ^/ ~ ^ X™'/^ 1 - 7s)?/ , (81) 
/ / 

where 

m' f = m f e l2l3 i (x) , (82) 
m'* = m* e -* 2 W . (83) 
Finally, the measure for the path integral over quark fields transforms [27] as 

[dq] [dq] - exp ( ^ J d'xG^G^ £ 2/3/ J [dg] [dg] . (84) 



/ / 



Hence: 



Z = Z[rf, ff, ^ = J [dq] [dq] [DA] exp (i J d 4 x [C m + qrf + ry'g + J^l ) , (85) 
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where, in the source terms, we shifted the chiral transformation from the fields to the 
sources: 

rj' f {x) = e^ {xh5 ri(x) , (86) 

fj' f (x) = f](xy p i^ , (87) 



and 



i (0 + 2E f /?/k 2 

"4 G G ^ 32^ G 

Using Noether's theorem, which relates the divergence of the current associated with an in- 
finitesimal continuous transformation on the fields to the change in the Lagrangian density 
under that transformation, we calculate the current associated with the chiral transforma- 
tions: 

q' f ( x ) = (1 + ip f (x) l5 )q f (x) + 0((3%x)) , (89) 
q' f (x) = q(x)(l + i(3 f {x) lb ) + 0(p 2 f (x)) . (90) 
The infinitesimal form of Eq. 88 is 

C m = Cqcd - Yl^Mvn^qf + i2(5 f {m fq y R f + h.c.) + M G <w* G y . (91) 
The current associated with the transformation (Eq. 90) is 

Noether's theorem states 

= - E id = E '2('»/9l/««/ + h-c) + J^G^-G^ . (93) 
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The explicit dependence of Cp on f3f(x) in Eq. 90 yields the non-conserved axial vector 
current 

« = ^QT 9/7m76?/) = J2^ m A^f + h - c -) + 4~l Ga ^ G U ■ ( 94 ) 
/ / 71 

The first term on the right hand side is due to the fermion mass, whereas the second is 

due to the anomaly which is a quantum mechanical effect (in fact the anomaly term is 

proportional to g 2 K). 

Equation 88 implies [28] that the physics of the theory defined by Cqcd is unchanged 
under the global (spacetime independent) transformations: 

q f -> e i/3 ^'q f 

rrif -> e~ i2/3f m f 

m* f -> e i2f3f m* f 

-> 0-2^. (95) 
/ 

In general, this set of transformations is not a symmetry of the theory because, when 
the dynamical variables qf are transformed, the parameters, rrif and 0, are needed to 
be transformed, to leave the Lagrangian invariant. If the physics were independent 
(transforming would not make any difference), then one would recover the classical 
result that the theory has an axial Ua{1) symmetry when rrif = (rrif transformations 
would not exist). We know, however, that {7a(1) is explicitly broken because otherwise 
there would be a fourth pseudo Nambu-Goldstone boson rj', in addition to the three pions, 
with m v > < v^3^tt- This can only be understood if physics is ©-dependent. In this case, Eq. 
95 is not a symmetry even in the massless limit, because the parameter is transformed. 
Hence, Ua(1) is broken even when the quark masses are zero. (If there is ©-dependence 
dfjJt* 7^ 0, due to the G aMiy *G^-term, even if rrif = 0. Hence the anomaly explicitly 
breaks L0i(l)). Therefore, QCD must have ©-dependence through non-perturbative effects 
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(as noted earlier, the 9-term is a four-divergence and hence does not contribute in the 
perturbative quantum theory). These effects that break C/a(1) are known as QCD instanton 
effects; see the Appendix. The instanton in QCD describes a classically forbidden, but 



associated with C/a(1) symmetry is not conserved. Therefore, the instantons explicitly 
violate C/a(1). The probability amplitude of an instanton event (see the Appendix) 



where // is the inverse of the instanton size and a s is defined in Eq. 69. This amplitude can 
not be expanded in a Taylor series in a s . Thus, the instanton is indeed a non-perturbative 
effect. At a finite T, using Eq. 69, we have 



At high temperatures non-perturbative (instanton) effects are suppressed. Around T ~ 
Aqcd, instanton effects fully turn-on and strongly break C/a(1) symmetry of QCD. 

The terms iqp^qf and G afiU *G'j 1L/ are P and CP odd. Hence their presence in the theory 
(in the £ mass and Cq, respectively) leads in general to P and CP violation. Moreover, the 
transformations given in Eq. 95 allow us to change the rrif (for each flavor) and 0. The 
apparent source of CP violation can be moved among the m/ and 6, by changing the 
phases of the quark masses. Since there are six transformations on seven parameters, we 
can not find a definition of quark fields that sets all the parameters real. We might choose, 
for example, the 6 as zero, m u , . . ., m b as real and m t as complex. One combination 
that can not be removed from the theory is — Arg Y[f m f = @ — 2 J2f Pf- Hence, the 6 
dependence of QCD comes only through the combination of parameters 



quantum mechanically allowed vacuum tunneling, in which the charge Q 5 ^ = J d 3 xq 



775?/ 




(96) 




(97) 




(98) 



/ 



/ 
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Let us also note that the transformations given in Eq. 95 leaves the invariant. For 
example lets transform the up-quark field only, then for the invariance, we have 

Hu T " Hu 

Arg(m u ) ->• Arg(m u ) - 2f3 u 

-> B-2(3 U , (99) 

and hence 

0^0. (100) 

Let us suppose that 0^0. Then, no matter how we shuffle the phases of quark masses, it 
is impossible to set = 0, and Arg(mj) = 0, V/. Therefore, it is inevitable to have some 
CP violation, with an upper limit consistent with the experiments. 

As we remarked, when 0^0, the term in Cqcd violates the P and T (and hence 
CP) symmetries. Since 

P : Ef -> -El B a t -> Bf (101) 

and 

T: E% -> Ef Bf -> -S? , (102) 

G a^* G a^ = -4E?B? is odd under P and T. Therefore, the 0-term would induce a CP 
violating term in the effective 7T-N coupling and produce an electric dipole moment for 
the neutron. It is of order [29, 30, 31, 32, 33, 34]d n ~ 5 ■ lO~ 16 |0|e • cm. Comparison 
with the experimental bound d n < 10~ 25 e • cm constrains [35] < 10 -10 . This fine tuning 
requirement on the parameter is known as "the strong CP problem." It is puzzling that 
the two independent contributions in the 0, due to the vacuum structure of QCD, and 
2 YlfPf due to the electroweak effects involving quark masses, seem to cancel each other. 

The most elegant solution of the strong CP puzzle is to replace the parameter by a 
dynamical field 0(x). This is achieved by introducing a Higgs field whose expectation value 
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v pq spontaneously breaks a postulated global U{\)pq Peccei-Quinn (PQ) Symmetry [36, 
37]. The axion is the Nambu-Goldstone boson associated with this spontaneous symmetry 
breaking. The defining properties of PQ symmetry are [28]: 

1) it is an exact global symmetry of the classical theory, before anomalies, 

2) it is broken spontaneously, 

3) it is broken explicitly by the QCD instanton effects which make the theory dependent. 
To see how the PQ mechanism works, consider the theory defined by 



: Q cd + pq = ~G%G^ + \d^4> - ^G%G a ^ 



+ - (K fq y Rf 4> + h.c.)] - V{4>*4>) , (103) 



/ 

where is a complex scalar field. Under U(1)pq, — > e ia (p and q f -> e ia ^' 2 q f . The 
potential V is a "Mexican hat" potential; hence, U(1)pq is spontaneously broken: 

(<j>(x)) = v PQ e ia ^ . (104) 

The quarks acquire masses: m/ = KfVpQe ia ^ x \ and hence 

B(x) = e-Arg([[m f ) = e-Arg(l[K f )-J2^(x) = e-Aig([[K f )-Na(x) . (105) 
/ / / / 

In this toy model N is the number of quarks. Notice that here 6(x) is a function of the 

dynamical field a(x), whereas in Eq. 95 is fixed. One can adopt a convention such that 

— Arg(Y[f m f) = 0. In Eq. 103 the non-perturbative effects which make the theory 

©-dependent produce an effective potential for 0. The field relaxes to the minimum 

of that potential. Therefore the axion is not massless but acquires a small mass due to 

non-perturbative QCD (instanton) effects. The zero temperature mass is given in terms of 

up and down quark masses m u and m d , and the pion mass m n [38, 39] as 

m a = 7— , 106 

m u + m d } a 
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where f a = -j^- and f n are the axion and pion decay constants, and N is a positive integer 
that describes the color anomaly of Upq{1). (Axion models have N degenerate vacua [40]). 
Numerically 

mo . 6 . eV (^X). (107) 

We now study the cosmological implications of axion models [41, 42, 43]. Let the 
potential for <fi(x) in Eq. 103 be 

At extremely high temperatures Upq(1) symmetry is restored, (<f>(x)) = 0. As the universe 
expands and cools below Tpq ~ vpq, the potential looks like a "Mexican hat" and is 
minimized for 

= v P Q exp (ia(x)) . (109) 

Notice that V(<f>) is independent of a(x). This massless degree of freedom is the axion: 
a(x) = vpq(x{x). The system settles down in one of the continuously degenerate ground 
states which are all equally likely. At much lower temperatures T ~ ^qcd, axions acquire 
mass. This is because U(1)pq is explicitly broken by non-perturbative QCD (instanton) 
effects at low temperatures. At high temperatures these effects are suppressed. When T 
approaches Aqcd, instanton effects turn on. They produce the effective potential 



v 



2 

PQ , 



ne) = <CO^(i-cos(e)), (no) 



where [44] 



m a (T) ~ 0.1m. ( ^ ) • (Ill) 



3.7 



The minimum of the total potential is at the CP conserving value Q(x) = Na(x) = 0. 
The axion develops mass m a , due to the curvature of the potential at the minimum. The 
effective Lagrangian density for the axion field is obtained by setting a(x) = 



Cos = ^d.ad^a - m 2 a {T)^_{l - cos (— )) , (112) 
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where we implicitly assumed that the signature of the spacetime is (H ). Variation 

of the action S = J d 4 Xy/^gC cS yields the sine-Gordon equation for a(x): 

Ua{x) + ^ml(T) sin (Na(x)) = , (113) 

where d'Alambertian □ = g)d fl (y/— gg^ v d v ). In a Friedmann universe where 

ds 2 = dt 2 - R 2 (t)dx ■ dx (114) 

Eq. 113 reads 

a + 3H(t)a - J^^ 2 ® + Jj m l ( T (^)) sin ( Na ) = • ( 115 ) 

Near the minima, the potential (Eq. 110) V(a) ~ \m 2 a v 2 P Q(x L . Hence, sin [Not) ~ Na in 
Eq. 115. In that case, the solution of Eq. 115 is a linear superposition of eigenmodes with 

— * 

definite comoving wave vector k: 

<•<*•'> = /*M£.*y. die) 

where the a(k,t) satisfy 

dt + Z^dt + + "ft = 0. (117) 

We are interested in the k = mode of vacuum misalignment contribution. The other 
contributions (of similar magnitude) are discuss by Chang, Hagmann and Sikivie [45]. 
Then Eq. 115 becomes 

a + 3H{t)a + m 2 a {t)a = . (118) 

This is the equation of a damped harmonic oscillator with time dependent parameters. 
Since the axion is massless when T ~ Tpq, no initial value ot\ of a is preferred dynamically. 
Although a ~ today, there is no reason to expect that was the case initially. Therefore, 
cci is chosen by some stochastic process. For m a = (as is the case for T 3> Aq C d), the 
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solution of Eq. 118 is obtained by inserting the ansatz a = t p to the differential equation 
and solving the resulting algebraic equation p 2 +p/2 = 0. The most general solution is 

a = ai + a 2 r 112 . (119) 

Quickly a approaches a±, which is a constant, and starts to oscillate when T approaches 
the critical temperature 7\ where [14] 

m a (Ti(ti)) ~ 3H (71(40) = J- . (120) 

To be more specific let's define the critical temperature as 

TOaC?l(tO)*l = |. (121) 

We will see below (Eq. 139) that fi a ~ 0.4 when m a ~ 6 • 10~ 6 eV (Eq. 107). In 
that case 7\ is about lGeV (Eq. 128). The typical wavelength of these modes is of the 
horizon size back then. The corresponding momenta of the particle excitations at birth are 
therefore: 

Pa(h) - l/d H = l/ tl = ~ 1.6 • 10- 9 eV l 1 -^^) ™ , (122) 

U.o m P i \ Ja J 

where we used time-temperature relation Eq. 30. Since their mass at that time 

2 

n^a(ti) ~ 2.4 • 10~ 9 eV (10 12 GeV// a ) 5 7 , they were semi-relativistic. Axions, however, gain 
mass rapidly. As the temperature drops to T ~ Aq C d ~ 200MeV, their mass increases 

3.7 

by a factor of 230 • (10 12 GeV// a ) IT , and axions suddenly become non-relativistic. Because 
the axion couplings are small, energy dissipation of the modes is negligible. The coherent 
oscillations produced by the vacuum misalignment are good dark matter candidates. 

The present velocity dispersion of these axions can be calculated using Eq. 122 and 
the conservation of entropy (Eq. 43): 

^(TOTfi^m) = ^ ff (T 07 )T 3 7 i? 3 (T 07 ) , (123) 
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Figure 4: Switch-on of the axion mass in the early universe. 



where g^T,) ~ 60 and g* cS (T 0y ) = 2, 

r / . \ _ /. \ Pa(«l)i2(*l) 1 1 A: ff (T 07 ) ^T 07 

m a i?(t ) rriah R(t ) m a t x \ ^ ff (Ti) / 7\ 

To estimate the velocity dispersion we need to calculate t\ and T\. Using Eqs. 107 and 
111 we find 

mJT) ^,. 10 -* eV (^(2fy . (125) 

Taking g cS ~ 60 in the time-temperature relation (Eq. 30), we recast Eq. 125 to read 

m a (T) ~ 3.1 • 10~ 51 (GeV)^ ^ 10 " GeV ^ t ¥ . (126) 
Then definition 121 yields 

''-^^(lo^v)*- 8 - 1 - 1 ^)"^!^)* ■ (127) 

The corresponding temperature is 

/10 12 GeV\ 

71 ~ 0.9 • GeV I j . (128) 

Inserting Eqs. Ill, 127 and 128, in Eq. 124 we estimate the present value of the axion 
velocity dispersion as 

4.7 

M*o) ~ 1-5 • It)" 17 . (129) 
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Since Sv a (t) = 5v a (t ) (R(t )/R(t)) = Sv a (t ) (t /t)* we have 



m a J \t 

Let us now calculate the energy density of the axion field due to vacuum misalignment. 
The energy density p of a homogeneous scalar field around the minimum of its potential is 

P= V -f[a 2 + ml(t)a>] . (131) 

The Virial Theorem implies 

(a 2 ) = m 2 (a 2 ) = . (132) 
V PQ 

Multiplying Eq. 118 by a and using Eqs. 131-132 we obtain 

'- = --4- < 133 > 

p m R 

whose solution yields 

P = const. (134) 

This equation means that as long as the axion mass varies adiabatically, the number 
of axions per comoving volume is conserved. Because the initial energy in the coherent 
oscillations is p\ = VpQm 2 (t 1 )a 2 /2, it is easy to estimate the present energy density p : 

m a (t ) R 3 (h) 1 2 R 3 (h) 2 

Po = Pi ,. s m(+ \ = ^PQ^it^ma——^ . (135) 

m a {ti) R 6 {t ) 2 v R A {t ) 



Using the conservation of entropy (Eq. 43), we find 

1 T 3 
Po ~ -^vlgmait^maj^-al . (136) 

If the universe never inflated, our presently observable universe should have 10 30 causally 
disconnected domains when the a oscillations started. Therefore it is reasonable to assume 
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that each of the domains had arbitrarily chosen initial amplitudes and take a\ as the rms 
average of the uniform distribution from —n/N to n/N: 

1/2 



(«l)r 



N 
2n~ 



(137) 



V3N ' 

If, on the other hand, the universe inflated after the PQ symmetry breaking, then our 
observable universe originated from a single domain. Although cei had the same value 
everywhere in our universe, it is arbitrary. Any value is equally likely. To determine 0i in 
this case, measurement of the axion mass and density are necessary. Assuming a ~ ^7§^m 
and using Eqs. 107, 125 and 128 in Eq. 136, we find 
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-30 S r ( fa X ' 7 



"°~ 35 ' 10 • (138) 

Dividing by the critical density p c = we obtain 

fi »~ - 4 '(lo4vf(¥) 2 - < 139 > 

where h is defined as usual by H = h lOOkm/sec • Mpc. 

Finally, following the same steps between Eqs. 54 and 57, we express the present 
axion number density n a in terms of Q a , the present axion energy density in units of the 
critical density: 

, , 1.5 10 6 ^ f h\ 2 /10- 5 eV\ ,, AnS 

"•(*-) = -S3" (or) (-ST") ' (140) 

As can be seen from the large exponents in Eqs. 57 and 140, WIMPs and axions 
exist in enormous numbers on astronomical length scales. Moreover, they have negligible 
velocity dispersion 5v x and 5v a implied by the exponents in Eqs. 51 and 130. Because of 
their small velocity dispersions, they are cold dark matter (CDM) particles. CDM particles 
are so weakly interacting (collisionless) that they have moved purely under the influence 
of gravity since they decoupled. Galaxies are surrounded by unseen CDM, and hence, 
because of gravity, CDM particles keep falling in and out of galaxies from all directions. 
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This continuous flow produces a discrete number of flows anywhere in the halo of a galaxy. 
This simple observation has rather interesting consequences. Caustics occur where the flow 
number jumps from n to n + 2, with n being an odd number. Generically, caustics are 
the surfaces in space which separate these regions with differing numbers of flows. The 
number density of CDM particles becomes very large as one approaches the caustic from 
the side with the two extra flows. The following section is devoted to the analysis of the 
CDM caustic surfaces in galactic halos. 
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3 Cold Dark Matter Caustics 



In this section, I make extensive use of the explanations and derivations given in [16]. We 
have seen in the Introduction that, before the onset of galaxy formation, the collisionless 
(weakly interacting) CDM particles lie on a time dependent thin 3D sheet in 6D phase 
space. The thickness of this sheet is the primordial velocity dispersion 8v. At the onset 
of galaxy formation, the particles in a neighborhood of an overdensity fall onto it. As a 
result, the phase space sheet winds up clockwise wherever galaxies grow (Fig. 1). Outside 
overdensities, at a location in physical space, there is only one value of velocity ( i.e., one 
single flow, the Hubble flow). There, the phase space sheet covers physical space once. 
Hence, in Fig. 1, away from the overdensity the plot is one to one. On the other hand, 
inside an overdensity, the phase space sheet covers physical space multiple times, but always 
an odd number of times, implying that there are an odd number of flows at such locations 
in physical space (Figs. 1 and 2). CDM caustics are associated with these flows [16]. We 
examine them rigorously in the next section. 

3.1 Caustics in General 

Let us consider a flow of CDM particles with zero (or negligible) velocity dispersion. To 
study the motion of the particles, we adopt a parametrization of the 3D phase space sheet 
by labelling each particle with an arbitrary 3-component parameter a = (a±, a 2 , a 3 ). The 
phase space sheet location at time t is specified by the map a — * x(a, t) where x is the 
position in physical space of particle a at time t. The velocity of particle a is v = || (a, t). 
Let dj(f,t), with j = 1. . .n, be the solutions of 



Here n is the number of distinct flows at spatial location r and time t. In general, the 
number of solutions n jumps by 2 on certain surfaces which are the locations of caustics. 



f — x(a, t) . 
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The total number of particles is: 
N = 



j3 d3N i -*\ 
dra - — : — — (a) 



dcixdciidoL-i 
d 3 N 



/it, 
da\ 

.7 = 1 



{aj(r,t)) 



i//n2^/n ;5 ' J> | det (§§) | Sj . (f;t) ' 

The density of particles in physical space can be extracted from the above equation: 

r/ ;i .V _ _ „ 1 



(142) 



d(f,t) = J2 



^ dcixdoLidoL-z 



(o,-(f,f)) 



D(d,t) | 5 ( f )t ) 



(143) 



where 



(dx\ 
— j = det(D) , 



(144) 



and X> is the Jacobian. As expected, the density d(r, t) is a reparametrization invariant: 
if we reparametrize the particles a — > (3, such that the n distinct flows (3j(r,t) are the 

— # 

solutions of r — x(/3,t), then: 



daidat2dat3 
dx 



det 



(9a 



det 



det 



(f) 



dpxdp 2 dp 3 



WW) =det (^] det (M 



(145) 



and hence, the density retains its form given in Eq. 143 with a replaced by (5. 

Caustics are wherever the density diverges (i.e. wherever D(a, t) = 0). Thus, the map 
a — > x is singular at the location of the caustics. Generically the zeros of D(a, t) are simple 
(i.e., the matrix || has a single vanishing eigenvalue). The condition that one eigenvalue 
vanishes imposes one constraint on the three parameters a. Therefore, the location of a 
caustic is generically determined by two independent labels, hence it is a 2D surface in 
physical space. 
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3.2 Caustic Surfaces 

Let's consider a generic caustic surface at a fixed time t. Using the reparametrization 
invariance a — > (3(a,t) of the flow, let's parametrize the particles near the caustic such 
that the surface is at @ 3 = 0. Also let's choose a Cartesian coordinate system in a neigh- 
borhood of point P on the caustic surface such that z is the perpendicular direction to 
the surface, whereas x and y are the parallel directions. Hence, in this neighborhood 
dz/dPi = dz/d(3 2 = 0. Thus, we have 

D = # det ( #4-1 , (146) 



d(3 3 ' \d((3 u (3 2 ) / 

around P. The two dimensional matrix d(x, y) / d{(5\ , f3 2 ) is nonsingular, whereas dz/df3 3 = 
at /?3 = 0, hence, by construction D vanishes at (3% = 0, where the caustic surface is. By 
Taylor expanding, we have up to the second order in /3 3 : 

z = z + Bfil . (147) 

We may choose the positive z-direction in such a way that B > 0. Then, since dz/dfis = 
z - z ), the determinant becomes 

D = 2y/B(z - zo) det ( ^'^ ) for z > ^ ■ (148) 

Hence, near a caustic surface located at z = z , the density diverges as ttj= on the side 
of the surface where z > zq. Figure 5. A below depicts a 2D cut of phase space in the (z, z) 
plane. The particles lie on the curve which is at the intersection of the phase space sheet 
with the (z, z) plane. The label /3 3 gives the position of the particles on this curve. The 
caustics, which are located at z — z\ and z = z 2 , extend in the x and y directions. Figure 
5.B depicts the density d(z) which diverges as 1 / yjz — z\ for z — > z\ with z > z±, and as 



1 / — z for z — > z 2 with z < z 2 . There is only one flow for z < Z\ and for z > z 2 , whereas 
there are three distinct flows in the region z\ < z < z 2 . The number of flows changes by 
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Figure 5: A generic surface caustic. A) In phase space. B) In physical space. The two 
dimensions (x and y) into which the caustic extends as a surface are not shown. The 
physical space density d diverges at those locations (zi and z 2 ) where the phase space 
sheet folds back. 



two at the locations of the caustics. The phase space sheet has a fold at z — z\ and z = z 2 , 
where dz/d/3 3 = 0. Hence the velocity space is tangent to the fold. Therefore, in physical 
space the particles pile up at z\ and z 2 where the sheet folds back. More rigourously, if we 
represent the 2D phase space density near a fold in the limit 5v — > by 

d 2 N 



- = N5(z - C(z) 2 ) , 



dzdz 

where C and iV are numbers, then, the density 
d 2 N 



d 



dzdz 



-dz = N I 5(z-C(z) 2 )dz = 



N 



Cz 



A 



(149) 




(150) 
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where A will be called the "fold coefficient." Notice that d diverges as z — > where the 
caustic is located. 

3.3 CDM Infall in Galactic Halos and Caustics 

In this section we will introduce the caustics associated with the infall of CDM particles 
into the potential well of a galactic halo. The existence and structure of the caustics will 
be studied in Sections 3.4.1 and 3.5.1 in detail, using the discussion given here. We set 
the velocity dispersion equal to zero (5v = 0) in our considerations. A small non-vanishing 
velocity dispersion would provide a cutoff, such that the density at the caustic would 
become very large rather than infinite. 

We have already seen in the introduction that the process of galaxy formation involves 
winding up of the phase space sheet of CDM particles which produces a discrete number of 
flows at a location in space. If the galactic center is approached from an arbitrary direction, 
the local number of flows increases. First there is one flow, then there are three flows, then 
five etc. (Fig. 2). The boundary between the region with one (three, five,. . .) and the 
region with three (five, seven,. . .) flows is the location of a caustic which is topologically 
a sphere surrounding the galaxy. As will be shown in Section 3.4.2, when these spheres 
are approached from the side with the two extra flows the density diverges as where 
a is the distance to the surface. We called these spheres "outer caustics." It is a little 
more difficult to see the existence of "inner caustics" in addition to the outer caustics; they 
can not be seen in Fig. 2. Inner caustic rings occur if the CDM particles carry angular 
momentum with respect to the center. In the limit of zero angular momentum a caustic 
ring becomes a caustic point at the center, whereas the outer caustics are unaffected. 

Let us first discuss the case where the overdensity is spherically symmetric and all the 
CDM particles have zero angular momentum. Then, the particles move in and out of the 
galaxy in radial orbits. As a result, the galactic center is a caustic point and the density 
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associated with each flow diverges as ^, where r is the radial coordinate. Caustic point is 
a degenerate case since we assumed spherical symmetry and radial orbits. 

To see the inner caustics, one has to consider the flow of the particles which carry 
angular momentum with respect to the center. In this will be seen in Section 3.4.1, 

outer caustics which surround the galaxy also exist. 

Let us now consider the case where the collisionless CDM particles move in and out 
of the galaxy only under the effect of gravity and rotate due to the angular momentum 
they carry. Figure 6 depicts the successive time frames of a set of CDM particles falling 
through a galaxy. This sequence is inevitable, for a flow in and out of the potential well 
of a galactic halo. In Fig. 6. A, the particles are about to fall onto the galaxy for the 
first time in their history. They make up a closed 2D surface, a topological "sphere," 
surrounding the galaxy in 3D physical space. We call this surface the first "turnaround 
sphere" (in fact, Fig. 6 depicts the intersection of this turnaround sphere with the plane 
of the figure, as it evolves in time). To be more definite, we assume that the particles 
carry net angular momentum about the vertical axis in this plane (i.e., the turnaround 
sphere is spinning about the vertical). The particles near the top (bottom) of the sphere in 
frame A, carry little angular momentum and end up near the bottom (top) of the sphere 
in frame F, after falling through the galaxy. The particles near the equator carry the most 
angular momentum. When the turnaround sphere crosses itself near frame C, they form 
a ring in the spatial section perpendicular to the plane of the figure. The radius of the 
ring decreases in time down to some minimum value, reached near frame D, and then 
increases again during the out flow. Outside this ring, the region inside the sphere which 
has not turned itself inside out makes a doughnut. When this doughnut is pinched out in 
spacetime (in frames C and D), as will be shown later in this section, a caustic ring occurs 
in space. We will designate this ring as the location of the "inner caustic." The radius of 
the sphere at the second turnaround in frame F is smaller than the first turnaround radius 
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A 



B 




Figure 6: Infall of a turnaround sphere. The closed lines are at the intersections of the 
sphere with the plane of the figure at six successive times. The sphere has net angular 
momentum about the vertical axis. It crosses itself between frames B and C. The cusps in 
frames D and E are at the intersection of a caustic ring with the plane of the figure. The 
dot, which represents a particle in the flow, is introduced to keep track of the sequences 
in the flow. The whole infall sphere completes the process of turning inside out just after 
frame E. 

because of the deepening of the potential well of the galaxy due to the infall occurring in 
the meantime. The second turnaround sphere falls back in, repeating the same sequence 
of Fig. 6, although it gets fuzzier due to the gravitational scattering of inhomogeneities in 
the galaxy, and so on. There is another generic surface caustic associated with the mth 
turnaround, where m = 2,3,4, ... . We will call these "outer caustics." At the location 
of the outer caustics, the number of flow changes by two (this is also true for the inner 
caustic rings as well), and particles pile up since the phase space sheet in this location is 
tangent to velocity space (Fig. 2). There is no change in the number of flow near the 
first turnaround (m — 1), hence there is no caustic associated with it. Outer caustics, in 
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fact, are located near where the particles of a given outflow reach their maximum radius 
before falling back in. To illustrate this, let us parametrize the flow at a given time t by 
x(a, (3, to; t), where to is the time the particle was at last turnaround, a and (3 label the 
particles on the turnaround sphere at that time. For example, polar and azimuthal angles, 
which tell us where the particle was on the turnaround sphere, can be used to label the 
particles. The vector, x, is therefore the location of the particle a = (a,j3,t ) at time t in 
physical space. Let us define: x to = , x a = xp = || , and x = ||. Therefore, the 
determinant D(t ,a, (3;t) = det (||) can be written as 

D = xto ■ (x a x xp) . (151) 

The remaining discussion will focus on the flow at a fixed time t (we will discuss on the 
snapshot of the flow at time t). As a result, we do not show the t dependence any further. 
The existence of outer and inner caustics in the presence of angular momentum will be 
proven in Sections 3.4.1 and 3.5.1, respectively. 

3.4 Outer Caustic Spheres 

3.4.1 The Existence of Outer Caustic Spheres 

Let us reconsider now the evolution of the spheres in the frames of Fig. 6.A-F and, after 
frame F. In the beginning, all the later infall spheres, which have greater to, are outside 
the sphere {x(a,(3, t ) : \/a,/3} of frame B, and t increases in the outward direction. 
Because the particles on the sphere are moving inward (hence x(a, (3, to) is pointing inward 
for all (a, (3)), the vectors {x to (a, (3, to) : ~ia,f3} point outward. In frame E the initial 
infall sphere is just about to complete the process of turning itself inside out. Except for 
the particle at the cusp, all the particles on the sphere have already started their out flow. 
After frame E the process is completed. Hence, inside the sphere there are later infall 
spheres and t increases in the inward direction. Thus, even though the particles on the 
sphere in frame F flowing outward (hence x(a,(3,to) is pointing outward for all (a, (3)), 
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the vectors {x to (a, (3, to) : Wa,j3} point inward. Note that x is always in the direction of 
motion and x to points opposite to it, since t decreases in the direction of motion (particles 
in the direction of the motion started to fall earlier, hence they have smaller to). After 
frame F, x to (a, (3, to) will remain inward until the particle x(a,f3,to) reaches the next 
turnaround where it starts to fall back again. After the turnaround the motion is inward 
and x to (a, (3,t ) is outward. Therefore, at some point in the evolution x to (a, (3, to) either 
vanishes or becomes parallel to the sphere {x(a, (3, to) : Va, (3}. At that point D = 0, since 
x a x xp is perpendicular to the sphere. At a given fixed time the collection of all these points 
makes up a closed continuous surface (topological sphere) in space. Or, we can go back in 
time and choose the initial sphere {x(a', (3', t (a', (3')) : Va',/3'} such that at the future 
given fixed time {x to (a', (3', to (a', (3')) : Va', (3'} is parallel to {x(a', (3', to(a', (3')) : Va',/3'}, 
where D — 0. Therefore, the initial sphere {x(a', (3', to(a', (3')) : Va',/3'} will have 
simultaneously vanishing D at the given fixed time, making up a caustic which we call 
an outer caustic. 

To visualize that we really have a closed continuous caustic surface in the flow of 
CDM particles in three dimensional space, recall that the particles that fell in can not 
reach their last turn around radius again. Therefore, at any radial direction f from the 
center of the galactic halo, there are locations where the number of flows jumps from n + 2 
to n, where n is an odd number. As r traces out all the directions in the sky, all these 
locations make up a closed surface (for a fixed n) due to the continuity of the flow. In 
phase space, the 3D phase space sheet folds back at the locations where the surfaces are. 
Hence, at the folds the velocity space is tangent to the phase space sheet and in physical 
space particles pile up at the locations of the corresponding folds. The number density 
diverges (in the limit Sv = 0) at the surfaces because it is the integral of the phase space 
density over velocity space. 
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3.4.2 Density Profiles of Outer Caustic Spheres 

Outer caustics are closed surfaces (topological spheres) near the (n + l)th turn-around 
radii with n — 1, 2, 3. . . . Indeed the number of flows changes by two at the turnarounds 
because of the fall back of the particles. There is no caustic associated with the first 
turn-around (Fig. 2). 

The outer caustics are described by simple "fold" (A 2 ) catastrophes. Their density 
profile is 

d n (a) = ^e(a) (152) 



for small a, where a is the distance to the caustic, O(cr) is the Heaviside function, and A n is 
a constant which we call the "fold coefficient." We choose a > on the side with two extra 
flows (i.e., towards the galactic center). Therefore, when an outer caustic is approached 
from the inside, the density diverges as ~ abruptly falling to zero on the outside. The 
observation [46] of arc-like shells surrounding giant elliptical galaxies can be interpreted 
[47] as outer caustics in the distribution of baryonic matter falling onto those galaxies. 

To estimate A n in Eq. 152, consider the time evolution of CDM particles which are 
falling out of a galactic halo for the nth time and then falling back in. Let R n be the 
turn-around radius. We assume that the rotation curve of the galaxy is flat near r = R n 
with time-independent rotation velocity v TOt . The gravitational potential is then 



\ > -<4>(^) (l- r >3) 



The particles have a trajectory r(t) such that 

dr 



V2 (E-V(r))=v Iot ^2\n (^j , (154) 



dt 

where E is the energy per unit mass. Equation 154 neglects the angular momentum of 
the particles. This is a good approximation at turnaround since the particles are far from 
their distance of closest approach to the galactic center. Finally, we assume that the flow 
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is stationary. In that case the number of particles flowing per unit solid angle and per unit 
time, is independent of t and r, and the caustic is located exactly at the (n + l)th 
turnaround radius R n . 

Let us emphasize that none of the assumptions spherical symmetry, flat rotation 
curve, time independence of the gravitational potential, radial orbits, and stationarity of 
the flow affect the existence of the outer caustics or the fact that they have the density 
profile given by Eq. 152. The assumptions are made only to obtain estimates of the 
coefficients A n . 

The mass density of particles, d n (r), follows from the equality: 2j^dt = ^£p-r 2 dr, 
where m is the mass of each particle. The factor of 2 appears because there are two distinct 
flows, out and in. Using Eq. 154, we obtain the density distribution near the nth outer 
caustic: 



d n {r) = 2- 



dZldt 



(155) 



n r 2 v rot y / 21n(^) 

where = rn-^^. Near the caustic, In (^) = where a = R n — r. Inserting this into 
Eq. 155 and comparing it with Eq. 152, we find 

y/2 dM 



v r ot dVLdt 



K 3/2 ■ (156) 



Estimates of R n and can be extracted from ref. [48, 49] for the case of self-similar 

n 

infall [50, 51]. The infall is called self- similar if it is time-independent after all distances 
are rescaled by the turn-around radius R(t) at time t and all masses are rescaled by the 
mass M(t) interior to R(t). In the case of zero angular momentum and spherical symmetry, 
the infall is self-similar if the initial overdensity profile has the form = {^f^j where 
M and e are parameters [50]. In CDM theories of large scale structure formation, e is 
expected to be in the range 0.2 to 0.35 [48, 49]. In that range, the galactic rotation curves 
predicted by the self-similar infall model are flat 
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The R„ and 



dM 



do not depend sharply upon e. Our estimates are for e = 0.2 



dfldt 

lb 

because they can be most readily obtained from ref. [48, 49] in that case. For e = 0.2, the 
radii of the (n + l)th turnaround spheres are approximately 

{R n : n = 1, 2, . . .} ~ (240, 120, 90, 70, 60, . . .) kpc • ( , ) (^-) . (157) 



Moreover, one has 

dM 



dQdt 

with 



220km/s J V h 



' KV2^ , (158) 



n v rot 4nG 



{F n : n = 1,2,...}- (20, 8, 5, 4, 3,...) 10^ 2 . (159) 
Combining Eqs. 156 - 159, we find 



10" 5 er 

{A»:n = l,2,...}~(2, 2, 2, 3, 3,...) 



cm 2 kpc 1 / 2 

\ 1/2 / , \ 3/2 
^rot \ / h 



(160) 



220km/sy V0.7, 

Generally the outer caustics are "concave" (i.e., they are curved toward the side which has 
two extra flows). Their radius of curvature is of order R n . The lensing by concave caustic 
surfaces is discussed in Section 4.2.2. 

Next, we will discuss inner caustics which, as we will soon see, have the shape of rings 
(closed tubes) and are located near where the particles with the most angular momentum in 
a given inflow reach their distance of closest approach to the galactic center before moving 
back out of the galaxy. 

3.5 Inner Caustic Rings 

3.5.1 The Existence of Inner Caustic Rings 

During each infall-outfall sequence, the sphere of Fig. 6 turns itself inside out. For example, 
a particle (represented by a dot throughout the frames of Fig. 6) which is part of the flow 
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and is just inside the sphere in frame A, appears outside the sphere in frame F. There is, 
therefore, a ring in space-time of points which are inside the sphere last. The intersection 
of this space-time ring with the plane of the figure is at two space-time points, one located 
at the cusp in frame D, the other at the cusp in frame E. Although the cusp in frame 
D is smoothed out in frame E, and the cusp in frame E does not exist in frame D, since 
there is a continuous flow of spheres falling in and out that are continuous deformations 
of one another, the cusp-ring just defined is a persistent feature in space. Here, we should 
note that it is possible mathematically to turn a sphere inside out by means of continuous 
deformations which allow the surface to pass through itself without puncturing, ripping, 
creasing, pinching, or introducing any sharp points in to the intermediate stages. This 
process is called the "sphere eversion" [52, 53, 54] in topology. In our problem, however, 
the particles of the sphere are constrained to move purely under the effect of gravity and 
the angular momentum they carry. When a sphere is pushed bottom up and top down 
through itself as in Fig. 6.B, a ring shaped fold occurs where the sphere intersects itself as 
in Fig. 6.C. This ring-fold can not be eliminated without creating a sharp crease (the cusps 
of Fig. 6.D and Fig. 6.E) in the physical process we are considering. In the mathematical- 
sphere-eversion process, the ring is pulled down on the sphere becoming smaller and then 
the ring is squeezed so that its two sides touch and merge. 

We, now want to show that the cusp-ring, as defined early in the previous paragraph, 
is a caustic location (cusps in Fig. 6.D-E are at the intersections of a cusp-ring with the 
plane of the figure). Consider Fig. 7 that zooms in to the neighborhood of space and time 
where the infall sphere is completing the process of turning itself inside out. Coordinates x 
and z are chosen as shown in the figure. The y direction is into the plane of the figure and 
parallel to the ring at point E. We parametrize the flow in a small neighborhood of the 
ring by a = (a,(3,t ) such that || = || = 0. As before, t labels successive infall spheres 
and may be taken to be the time of their last turnaround. Thus: 




(161) 
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Figure 7: An infall sphere near the location where and when it completes the process of 
turning itself inside out. The curves are the intersections of the sphere with the plane of 
the figure at two successive times, corresponding to frames C and D in Fig. 6. Parameter 
a labels points along the line. |^ = at points A and B. |^ = at point C. Points A, B 
and C move to point E, which thus is the location of a caustic since D = there. 

In the top figure of Fig. 7, the infall sphere is about to reach to the ring. Parameter a 
labels points (particles) along the curve. At points A and B, |^ = because the tangent 
vectors to the curve at these points are parallel to the ^-direction. At point C, on the 
other hand, || = because the tangent at C is parallel to the indirection. In the bottom 
figure of Fig. 7, the infall sphere reaches the ring. Points A, B and C of the top figure 
have moved to point E in the bottom figure. Hence |f = = and, therefore, from Eq. 
161, we have D = at point E. Thus, E is a caustic location. Since, in general, D only 
has a simple zero at point E, E is in fact a location of a surface caustic. In the discussion 
that follows, we will show that the complete inner caustic surface is a closed tube and E 
is a point on this tube. 

To see the tube topology, consider Fig. 6.B. The vector x to = (not shown explic- 
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itly) points outward everywhere on the infall sphere since the later (earlier) infall spheres, 
which have greater (smaller) t , are outside (inside) of this one. In other words, t increases 
in the outward direction. In Fig. 6.F, however, where the sphere completes the process 
of turning itself inside out, all the later (earlier) infall spheres are inside (outside) of this 
one. Therefore, x to points inside everywhere on this sphere. This implies that during 
the infall-outfall, x to either vanishes or becomes parallel to the sphere at some space-time 
points. D = at such points, hence they are the location of caustics. Since D is a con- 
tinuous function of a = (a,P,t ), the caustic must lie on a closed surface (vanishing of 
D constrains the three parameters leaving only two independent ones, hence we have a 
surface). Now, if we follow the motion of points which are near the North (South) pole of 
the sphere in Fig. 6.B and, end up near the South (North) pole in Fig. 6.E, we see that 
x to always points up (down). It does not reverse its direction. Hence, x to does not vanish 
and is not parallel to the sphere at any time between these two frames for these points. 
Therefore, within a cylindrical region extending from North to South in the spatial volume 
under consideration D does not vanish. This implies that inner caustics do not exist in this 
cylindrical region. On the other hand, for the points near the equator in Fig. 6, x to points 
outward during infall and points inward during outfall. For example, at the East (West) 
pole in Fig. 6, x to points outward from frame A to the the end of frame D (frame E) and 
points inward in frames E and F (in frame E). Thus if we track a point near the equator, 
at some time x to either vanishes or is parallel to the sphere, hence D vanishes. The points 
where this happens lie on a closed surface which wraps outside of the previously defined 
cylinder. Therefore, the surface is a tube located near the equator, where the particles 
with the most angular momentum are at their distance of closest approach. 

According to the Brouwer's Hairy Ball (or Fixed Point) Theorem, an everywhere 
nonzero tangent vector field on the 2-sphere does not exist. Any continuous tangent vector 
field on the sphere must have at least one point where the vector is zero. Finding examples 
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is easy. Since the sphere with one point removed is homeomorphic to the plane, using the 
fact that a constant (and thus nonzero) vector field on the plane exists, we can obtain a 
vector field on the once-punctured sphere. Or, consider a line L tangent to the unit sphere 
S in R 3 at the point N. For every plane P containing L, consider the intersection of P with 
S. These create a family of circles C on S all separated from each other, except that they 
all contain the point N. Now define a vector field on S at each point x, using the tangent 
direction of the unique circle C through x, with a magnitude equal to the distance from 
the point N. Hence, the zero vector is at N. Another way of stating the theorem is that 
given a ball with hairs all over it, it is impossible to comb the hairs continuously and have 
all the hairs lay flat so that they change direction smoothly over the whole surface. At 
least one hair must be sticking straight up. This implies, as a corollary, that somewhere 
on the surface of the Earth, at any moment there is at least one point with zero horizontal 
wind velocity. 

Consider now the angular momentum distribution on the turnaround sphere. This 
is a special continuous two-dimensional vector field on the sphere which has two zeros at 
the poles where the net angular momentum axis crosses the sphere. When we tracked 
the particles in the neighborhood of these angular momentum zeros, we found that their 
Xt does not vanish and is not parallel to the sphere at any time during the infall-outfall 
process. Therefore the inner caustics appear only in the flow of particles which are some 
distance away from both zeros. Since that set of particles has the topology of a closed 
ribbon (I x S* 1 where J is a closed interval in R), and the previously defined caustic ring 
(the set of points E which are in the turnaround sphere last) goes around this closed ribbon 
once, the inner caustic must be a closed surface with one handle (i.e., a closed tube). 

Figure 8 depicts the snapshot of the flow near the caustic. The curves are where 
the simultaneous (same t) infall spheres corresponding to five different initial times: i i > 
to2 > ■ ■ ■ > to5 intersect the plane of the figure near the caustic. The five numbered dots 
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Figure 8: Qualitative description of the flow near a caustic ring. The solid lines are at 
the intersection of five simultaneous infall spheres corresponding to different initial times 
^05 < ^04 < • • • < £01 ■ The five numbered points are at x(a,tok), k = 1 , . . . 5, for some 
value of a. Points E and F are defined in the text. The closed dashed line is at the 
intersection of the caustic tube with the plane of the figure. There are four flows inside the 
dashed line whereas outside there are two. The galactic center is to the left of the figure. 

show the positions x(a,t k), k = 1, ... 5, for a fixed a and five different initial times and 
following the dots gives one a qualitative picture of the flow in time. The £ 01 sphere is 
falling in but has not yet crossed itself, as in frame Fig. 6.B. The £02 sphere has crossed 
itself but has not yet completed the process of turning itself inside out, as in frame Fig. 
6.C. The to3 sphere is just completing the process of turning itself inside out, as in frame 
Fig. 6.D. The cusp at point E is the location of a caustic for the reason given earlier. Let 
(Xe be the value of a at E and hence, x((Xe, £03) be the position of E in space. The particles 
at the cusp are moving to the left. Thus at (oe,£ 3), % is pointing to the left, whereas x to 
is pointing to the right of the £03 infall sphere. Although time £ increases in the direction 
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of motion, initial time to decreases (earlier infall spheres are older, hence t increases in the 
direction of motion, or equivalently they have smaller birthday to). Therefore ^- ~ — J^. 

For smaller initial times, such as t 05 , the spheres move to the right; hence x points 
to the right of the infall sphere, whereas xt points to the left. Let to4 be the initial time 
and F be the point where x to (aE,t ) crosses the t 4 sphere. Then, according to Eq. 151, 
D = x to ■ (x a x xp) vanishes at F because x to is on the sphere and (x a x xp) is perpendicular 
to the sphere at F. Thus F is the location of a caustic as well. Now, let us consider any 
other point which is far from both E and F . At such a point there are two flows because 
the sphere passes such a point twice, once on the way in and once on the way out. Consider 
also a point between E and F. At such a point there are four flows because the sphere 
passes by four times. We can count these flows on Fig. 8 at fixed t as follows: twice for 
initial time t between t 02 and t 3 (down and up flows), once between t 03 and t 4 ( m flow), 
and once between t 4 and to5 (outflow). Therefore, there is a finite compact region whose 
boundary in the plane of Fig. 8 is shown by the dashed lines. Inside the region there are 
four flows (down, up, in and out), while outside there are two (in the left of the region, 
down and up; in the right, in and out). The boundary of this region is the location of the 
tube caustic. If the successive spheres all fall in exactly the same way, the caustic ring 
is stationary. In general, however, the trajectories of successive spheres change slowly in 
time t, due to the growth of the halo during the infall and inhomogeneities. As a result 
the caustic ring moves about. 

3.5.2 Axially Symmetric Caustic Ring 

In this section we assume that the dark matter flow in and out of the galaxy is axially 
(rotationally) symmetric about z and reflection symmetric under z — > —z. We use the 
following parametrization of the flow. Let R(to) be the turnaround radius in the z = 
plane at time t . Then let x(9 Q ,(p Q ,t ;t) be the position at time t of the particle that 
was at the location of polar and azimuthal angles (#o, <^ ) on the sphere of radius R(t ) at 
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time t - With this choice, we keep the size of the sphere that we use to label the particles 
comparable to the size of the growing galaxy. The density is given by Eqs. 143 and 144 
with a = (9 ,(fo,t ). There are n flows outside the closed tube, whereas there are n + 2 
flows inside, as described in a general fashion in the previous section. 

Although the flow is in 3D space, in general, the dimension along the tube is irrelevant 
as far as the caustic properties are concerned. Particles are actually moving in this direc- 
tion, however the motion is a simple rotation along the azimuthal direction. To discover 
the essential properties of the caustic tubes, it is sufficient to consider the cross-section 
perpendicular to the trivial direction. Thus, the flow is in effect two dimensional, even 
in the absence of axial symmetry. The essential properties of caustics are invariant under 
continuous deformations. To proceed further, we will assume axial symmetry of the flow in 
this section. Therefore, let's throw away the irrelevant ip coordinate and choose p(a,to',t) 
and z(a,t ;t) as the cylindrical coordinates at time t of the ring of particles which start 
to fall-in with the polar angle 9 = | — a at initial time t . We then can write the number 
of particles 

/f d 3 N f d 2 N 

pdpdz2nd(p,z,t) = dadt d(p- — ; — —(a,t ,(f>)— dadt - — — (a,t ) . (162) 
J dadt d(p J dadt 

In the last equation, changing variables (a, to) — > (p,z), we obtain 

n( ^ t} d*N , , 1 



f , , d N t 

J p 22 d^r ^> 

.7 = 1 



where the determinant of the Jacobian 



D 2 (a,t )\ 



(163) 



(a, to)=(a, t )j(p, z, t) 



D 2 (a,t ) = det [ §^ ) , (161) 



,<9(<Mo), 

and (a, t )j, with j — 1. . .n, are the solutions of p = p(a, to; t) , z — z(a, to] t), and n is a 
function of p, z, and t. Comparing Eqs. 162 and 164, we find 

1 »<g«> d 2 N _ 1 

3=1 
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d (P> z ^ = W P ? d^dTo {aM | D 2 (a,t ) 



(165) 

(a,t )=(a,t )j(p, z, t) 



Under a reparametrization of the flow (a, t ) — > [o/(a, t ), t' (a, to)], the determinant trans- 
forms according to 

D 2 (a,t ) = D' 2 (a',t' ) det (^4\) ■ ( 166 ) 
In particular, for an a-dependent time shift: 

a' = a, t' = t + At (a) , (167) 



the Jacobian of the transformation det 




= 1, and 



D 2 (a,t ) = D' 2 (a',t' ) . (168) 

Therefore, reparametrizations of type Eq. 167 leaves the determinant D 2 unchanged. 
3.5.3 Flow near an Axially Symmetric Caustic Ring 

In section 3.3 we qualitatively described the CDM flow near a ring caustic, as summarized 
in Fig. 8. In this section, we give a quantitative description of ring caustics in the case 
of axial and reflection symmetry. Reflection symmetry is automatic if the caustic is tight, 
namely if the transverse dimensions p and q of the caustic are small compared to the tube 
radius. We will assume that the caustics are tight. 

Recall that the flow is reparametrization invariant. In Fig. 8 we describe the flow at 
given time t. The time of observation is fixed. However, we use initial time to to label the 
particles. Now, let us introduce a Cartesian coordinate system on Fig. 8 by choosing the 
point E as the origin, the horizontal axis as the x-axis, and the vertical axis as the z-axis. 
In the reflection symmetric case, points E and F have z — 0. Let us label the particles 
on the z = axis by (a, r — 0) and relabel the flow by shifting the initial time to in an a 
dependent way: a — > a, t — > r = t + At (a) such that z(a, r = 0) = for all a. Physically 
this means that we relabel the particle going through the cusp at E by (a, r) = (0, 0) and 
all the other particles on the negative x-axis (z — 0), by (a, r = 0). Due to the reflection 
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Figure 9: Qualitative descriptions of the reflection symmetric flow near a caustic ring. A) 
Same as Fig. 8 except that now the flow has reflection symmetry and we introduce a 
coordinate system whose origin is at the cusp E. a — at E. The x-coordinates of the 
particles labelled by ±ct are the same whereas their ^-coordinates have the opposite sign. 
B) We reparametrize the flow by shifting t i n an a dependent way: to ~~ T — to + At (a) 
such that z(a, r = 0) = for all a. We label the particle at the cusp by (a, r) = (0, 0) and 
all the other particles on the negative x-axis by (a, r = 0). Due to the reflection symmetry, 
we have (z,x) — > (—z,x) under (a, r) — > (— a, r). This fixes the labelling for the rest of 
the particles in the flow. 

symmetry, we must have (z,x) — > (—z,x) under (a,r) — > (— a, r). We are interested in 
the caustics in the flow. The properties of a caustic depend on the flow near the caustic 
only. It does not matter what happens in the flow away from the caustic. Therefore, we 
want to expand the flow (x and z) near the caustic in powers of the parameters a and r 
keeping terms up to second order only. Let's start with the shell of particles that is going 
through the cusp. This particular shell is in fact a line segment on the x-z plane where 
z = and x < 0. Recall that the parameter r is chosen as zero for this line. Hence, the 
parametrization satisfying the above symmetry requirements is 

z = x = -^sa 2 , (169) 

for this line of particles of the flow, where s is an arbitrary constant. Now, lets expand 
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the flow around the cusp where a = r = by writing the most general second order 
polynomials: 

z = C\ a + C2 r + C3 a 2 + c 4 ar + C5 r 2 , 

x = d\ a + d 2 r + d 3 a 2 + d 4 ar + d 5 t 2 . (170) 

Due to reflection symmetry, z must be odd whereas x must be even under (a, r) — > (—a, r). 
We then have c 2 = C3 = C5 = in z and d\ = d 4 = in x. Applying the condition that at 
t = 0, z = and x = — ^sa 2 , we identify C\ — and d 3 = — |s. Renaming the constants 
as c 4 = b, d 2 = — utq, d 5 = \u where u and r are constants, we have 

z = bar , (171) 
x = l -u[{r-r ) 2 -rl]- l -sa 2 . (172) 

If we call po the p-coordinate of point E where a = r = 0, we have x = p — po- Equation 
172 can then be written as 

p = a + ^w(r - r ) 2 - isa 2 , (173) 

where we introduce a new constant 

a = po - ^utq = po - p , (174) 

which will turn out to be the inner tube radius (p-coordinate of point F where a = and 
t = t ) and p = ^mt, 2 , the longitudinal dimension of the tube. The parameters b, u and s 
are positive. 

The parameter b is positive because the flow is from top to bottom, ^ < 0, (bottom 
to top, ^ > 0) for particles with a > (a < 0). Remember that particles closer to the 
z = plane reached the turn around sphere earlier; therefore, they have smaller t , and 
hence smaller r. Although time t increases in the direction of motion, r decreases. Hence, 
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for particles with a > (a < 0), *g ~ -f > § < 0). Since § = 6a, the 

constant b > 0. Thus, 6a is the negative of the velocity in the z direction of the particle 
coming in at an angle a. For particles with a = the velocity in the z direction is zero, 
because they move in the z = plane. 

The parameter u is positive because the particles with a = are accelerated outward 
by the angular momentum barrier, hence p = a+\u{r— r ) 2 > a. Thus, u is the acceleration 
of the particles away from the distance of closest approach. The parameter s is positive 
because at r = 0, the particles with a^O are at p = p — |sa 2 < p . The parameter r , 
which is the time it takes a particle to go from the cusp to the radius of closest approach, 
can either be positive or negative. Since p = \ur^ > 0, a = p — p is always smaller than 
Po (i.e., point F is always closer to the galactic center than point E, as noted earlier). 

The determinant of the Jacobian D 2 (a,r) = detV(a,r) where 

—sa br 

V(a,r)= | | = ( | , (175) 

u(t — t ) ba 
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and vanishes for 



D 2 (a, t) = -b[ur(r - r ) + sa 2 } , (176) 



a = ±^r(r -r) , (177) 

with < t < t if t > 0, and r < r < if r < 0. Since the CDM density diverges where 
D 2 vanishes, by substituting Eq. 177 into Eqs. 171 and 173, we find parametric (r = 
parameter) equations describing the cross-section of the caustic surface in the (p, z)-plane: 

p = a+^u{r-r )(2r-r ) , (178) 



: = ±^/V(to-t) . (179) 
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Figure 3 shows a plot of the cross-section. Both p(r) and z{r) are extremized for 
r = |r , where p = a — |p and z = ±^p-^=p. At r = 0, p = a + p and 2 = 0. At r = r , 
p = and 2 = 0. Thus, the dimensions of the cross section in the p and z directions are 



P = \ UT Q and 1 = — j— "7= P > ( 18 °) 
2 4 vms 



respectively. The caustic has three cusps: one at r = where (p, z) = (a + p, 0); and two 
at t = I r where (p, z) = (a — |p, ±^g). 

The three cusps are related by Z 3 transformations. To show this, let us rescale the 
parameters: 

T = -, A=Jl-a, (181) 

T V U T 

and rescale and shift (in the p direction) the coordinates: 

„ 2 /si 2 , ,1 

— 2 2 , A ^ 

Eqs. 171 and 173 then become 



tV.^' • Y =ri ( "- a) -4- (182) 



Z = 2AT , X = (T - l) 2 - A 2 - - . (183) 

These relations are invariant under the discrete transformation: 

Z' = -X sin(-27r/3) + Z cos (-2tt/3) = --Z + — X , 

X' = Xcos(-2tt/3) + Zcos(-2tt/3) = ~^yZ - ^X , 

T> = - 1 -T + y^A+ 3 - , A' = -y^T- 1 -A+^. (184) 
2 2 4' 2 2 4 v; 

Notice that the cube of this transformation is the identity: Z'" = Z, X'" = X, A'" = A, 
and T'" = T. In the X-Z plane, the transformation is a rotation by 120°. It transforms 
the three cusps of the caustic into one another. Thus, after the rescaling, the caustic has 
a Z 3 symmetry. Hereafter, we will call the shape of Fig. 3 "tricusp." In the language of 
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Catastrophe Theory, the tricusp is a LL 4 catastrophe. The apparent reason for the two 
cusps which are not on the p-axis in Fig. 3 is that both 

g = ^(4r-3r ), (185) 

and 

dr 2V s y/f=T 1 ' 

vanish for r = |r . The simultaneous vanishing of jfr and ^ is, however, not an accident 
peculiar to our assumptions of symmetry and/or our limiting the expansion of p and z to 
terms of second order in a and r. This claim is easy to prove. Notice that the equation 
for the location of a generic caustic in 2 dimensions: 

n i \ d P dz d P dz n <iz>7\ 

defines a(r) such that [p{r) = p(cx(t),t), z(t) = z(cx(t),t)] is a parametric representation 
of the caustic location. Wherever p{r) has an extremum, z{r) is expected to have an 
extremum as well, since 

dp = df^da + dp =Q (lgg) 
dr da dr dr 



implies 



And Eq. (187) yields 



which implies 



dp 
da 



dz dz da 
dr da dr 



, (190) 



dz dz da dz (191) 
dr da dr dr 



if ^ and ^ are finite. In the same way, it can be shown that the vanishing of ^ implies 
that = 0. The cusp at point E may appear to have a different origin. Its apparent 

3 

reason is that z ~ ±T2 near r = which is at the boundary of the range of r. This, 
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however, is an artifact of the parametrization we have used. Indeed, we saw that the three 
cusps can be transformed to each other due to a Z 3 symmetry, hence the cusp on the p-axis 
can be given the same parametrization as the other two. In conclusion, the appearance 
of three cusps in the cross-section of a ring caustic is neither an accidental consequence of 
the assumed axial and reflection symmetries nor is it due to the expansion including only 
second order terms in Eq. 170. 

Inside (outside) the tricusp, there are four (two) flows. To count the distinct flows, 
let us restrict ourselves to the z = plane, where either a = or r = 0. If a = 0, then 
p = a + \u{t — T ) 2 > a. If r = 0, then p = a + \uTq — ^sa 2 < po- Thus, for z = and 
p > a, with a — 0, there are two flows parametrized by 



(n. T+ )= | 0. 7-,, J^(p-a) j , (a,r_)= lo,r -J^(p-a) j (.1.92) 



Both of the flows have 



whereas 



|> = 0) = -^(a = 0) = , (193) 



^(a = 0,7+) = -^(a = 0,t+) = -«(r+ - r ) = -^2u(p-a) , (194) 

^(a = 0, r_) = = 0, r_) = -u(r_ - r ) = + v / 2 M (p-a) . (195) 

Thus, we call the flow (a,r+), for which ^ < 0, the "in" flow and the flow (a, r_), for 
which || > o, the "out" flow. For z — 0, (a^0,r = 0) and hence p = p — |sa 2 < Po, 
there are two other flows parametrized by 



(>' + ■-)=( \/^(Po-p), 0] , (a_,r)= (-y^(po-p), 0j . (19G) 



for both of which 



^(r = 0) = -^(r = 0) = M r , (197) 
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whereas 



dz 
di 
dz 
di 



(a_,r = 0) 



(a+,T = 0) 



dz 
ih 
dz 



(a-,T = 0) = -ba- = +b\-(po-p). (198) 



(a+,r = 0) = -ba + = -b\ -(p - p) , 




Thus, we call the flow (a + ,T = 0), for which || < as the "down" flow and the flow 
(ck_,t = 0), for which || > as the "up" flow. Therefore, in the z = plane, there are 
four flows (in, out, down and up) for a < p < p , whereas there are two flows (down and 
up) for p < a, and two flows (in and out) for p > p . 

Away from the cusps, the caustic is a generic surface caustic, as described in Section 
3.2. Thus, if one approaches the boundary of the tricusp from the inside and away from 
any of the cusps, as will be shown for a sample point below, the density diverges as d ~ ^= 
where a is the distance to the boundary. If the boundary is approached from the outside, 
away from any of the cusps, then the density remains finite until the boundary is reached. 
To illustrate this, let us again restrict ourselves to the z = plane and calculate D 2 as 
p — a — > + and p — a — > 0_. In the former case there are four flows (up, down, in and 
out), while in the latter there are only two (up and down). The caustic is caused by the 
appearance of the extra in and out flows in the region p > a. For z — 0, and p — a — > + , the 



in and out flows have (a, r) = (0, r ± y \ (p — a)), respectively. Therefore the determinant 
(176) is 



and up) yield finite contributions to the density. At z = 0, for the down and up flows 




D 2 = Tbr^2u(p - a) . 



(199) 



Since r — > t as p — a — > 0+ when a = 0, D 2 — =F&r A/2-u(p — a) for the in and out 
flows, the density associated with these flows diverges as }_ . The two other flows (down 



(o:±,t) = (±y|(po — p) )0). Hence, the determinant 




D 2 = -bsa 2 ± = -2b(p - p) = -2b(a + p - p) . 



(200) 
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Therefore, in the limits p — a — > + , (p > a), and p — a — > 0_, (p < a), |D 2 | — ■> 25p for the 
down and up flows. The contribution of each flow to the density, in both regions, is 

Near the cusps, the behavior depends upon the direction of approach. For z = and 
P ~ Po ~^ 0- (inside the caustic tube), there are four flows: in, out, up and down. For 
p— Po - ► 0+ (outside the caustic tube), there are two flows: in and out. In the region p < p , 
in addition to the extra down and up flows, which both diverge in the limit p — p — > 0_, 
one of the common flows (out flow if r > 0, in flow if r < 0) of the regions also diverges 
in both limits p — p — > 0±, whereas the other (in flow if r > 0, out flow if r < 0) remains 
finite. Notice that near the cusp (unlike the case p — a < where both of the down and up 
flows remain finite as one approaches the caustic from the outside), there is a flow which 
diverges as one approaches the caustic from the outside of the tube. Hence, the cusps are 
more divergent then the folds. 

Let us first calculate the density due to the down and up flows which exist only in 
the region p < p , for z = and p — p — > 0_. Recall (Eq. 196) that in the z = plane, 
for the up and down flows, r = and a 2 = - (p — p ). Then, D 2 = —bsa 2 = 2b(p — p), 
and hence d ~ — !— . Let us now calculate the density due to the in and out flows which 



exist for both p < p and p > p . For p — p — > 0_, let us write p = p — a where a — * + . 
Then, Eq. 192 gives for the in and out flows 




(201) 



respectively. Thus, using Eq. 176, we find 




(202) 



If tq > the in flow gives, to first order in a, 



-bu 2r n 2 



a 



(203) 



u 
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As a — > 0, D 2 — > — 26-uTq = — 4op. Hence the density of the in flow d ~ ^ is finite for 
t > 0. The out flow, however, has a divergent density for r > 0. Since D 2 ~ 6er = 6(p — p), 
the density is d ~ If r < 0, the flows are time reversed: the out flow becomes the 

in flow and vice versa. Indeed, Eq. 202 gives D 2 — bcr, hence d ~ ~^~ p f° r the ^ n fl° w ) 
and D 2 — — 6m[2tq — |cr], which yields d ~ ^ for the out flow. For the region p > p , 
in the limit p — p — > + , p = p + a where a — > + . Then, for the in and out flows 



(a,r±) = (0,7b ± yr 2 + fa ) and D 2 = -bu[±T \r \^l + f + r 2 + §<r]. Therefore, the 
in and out flows have the same density as for the in and out flows where p — po ~^ 0_. If 
t > 0, the in flow is finite: d ~ ^ and the out flow is divergent: d ~ ^ r o < 0) 

flows are time reversed, hence the in flow has d ~ — and the out flow has d ~ 4-. 

Finally, let us calculate the density near the cusp at p = po an d as z — > 0±. Since 
p = Po — ut t + |r 2 — |ct 2 , at p = po where (r, a) = (0, 0), 

-«t t + |r 2 - ^a 2 = . (204) 

Near the cusp, where (p,z) = (po,0±), (r, a) ^ (0,0) and a and r are independent since 
the point of interest is not exactly on the caustic surface. The determinant D 2 = —b[ur 2 — 
ut t + so 2 } can be written at p = p , using Eq. 204 as 

D 2 = -bu [2r 2 - 3r r] . (205) 

Since - = br, D 2 at p can also be written as 



a 

v2 



D 2 = -u 



2z 2 z 
b a z a 



(206) 



Since near the cusp - = br — > 0, (-) 2 can be neglected next to - in the above equation. 
This is equivalent to neglecting the ur 2 term next to ut in Eq. 204. Therefore, from Eq. 
204 and Eq. 206 we obtain 

1/3 



2 U U Z 

a ~ — 2— r r = — 2— r — =>- a ~ 
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u 

- 2 Vs T ° Z . 



(207) 



and 



D 2 ~ 3ut - 

a 



u 2 brQs\z\ 2 



, (208) 

for one of the flows. Hence d ~ for this flow. Thus, we find that, the density also 
diverges in this particular limit near the cusp. 

The tube caustic collapses to a line caustic in the limit r — > with -j= fixed. In this 

limit 



1 , V27 b . . 
p= ur 2_ 0> g = ^L — p^O, (209) 

2 4 vms 



D 2 -»• -b(ur 2 + sa 2 ) , p -> a + J(«r 2 - sa 2 ) . (210) 
Therefore, we can rewrite 

D 2 = -2yfb 2 [{p -a) + sa 2 ] 2 = -2^-b 2 [{p - a) 2 + s 2 a 4 + sa 2 (ur 2 - sa 2 )] , (211) 

where we replaced p — a by \{ut 2 — sa 2 ) using Eq. 211. Since z = bar, we might write 

D 2 = -2^b 2 (p-a) 2 + usz 2 . (212) 

Hence the density 

„ . 1 d 2 N 1 , . 

d p, « = 7T- -T-7T -===== , (213) 
2vrp rfadto V& 2 (p - a) 2 + wsz 2 

in the r — > limit. When p and g are finite, but much smaller than a, Eq. 213 is 
approximately valid for p, q <^ a <^ a since the terms of order r in Eqs. 171, 173 and 176 
are negligible in this regime. 

3.5.4 Differential Geometry of an Axially Symmetric Caustic Ring 

In this section, the differential geometric properties of the axially symmetric caustic ring 
are studied. At the end, we calculate the principal curvature radii, which are needed for 
lensing applications, at an arbitrary point on the ring surface. Any surface in R 3 is uniquely 
determined by certain local invariant quantities called first and second fundamental forms. 
Following [55], let us derive the fundamental forms for the ring surfaces. 
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Let X = X(t, 0) be a coordinate patch on a surface. The differential of the map 
X — X(t, 0) at (r, 0) is a one to one linear map 

dX = X T dr + X^d(f) , (214) 

with X T = ^ , X<p = of the vectors (dr, <i0), which extend from (r, 0) to (r+dr, 0+<i0) 

— * — * 

in the r0-plane, on to the vectors X T dr + X^dcf) parallel to the tangent plane of the surface 
at X(t, 0). Notice that, we are using the symbols dr and d<f) both for the differentials of the 
coordinate functions in the r0-plane and for the components of a vector in the r0-plane. 
In the same way, we denote X T dr + X</>d(f) by dX. Recall that, since 

X(t + dr, + dxf>) = X(t, (j)) + dX + 0(dr 2 , drdcp, d<p 2 ) , (215) 

the vector dX is a first order approximation to the vector X(r + dr, + c?0) —X(t, 0) from 
the point X(t, 0) on the patch, to the neighboring point X{t + dr,<p + d(p). Consider the 
quantity: 

I = ds 2 = (X T dr + X^dcj)) ■ (X T dr + X^dcj)) 

= (X T ■ X T )d 2 r + 2{X T ■ X^drdcf) + (X • X^d 2 (\> 

= E d 2 r + 2F drd(fi + G d 2 <\> . (216) 

The function / is called the first fundamental form of X — X(t, 0). The first fundamental 
coefficients E, F and G are functions of r and 0. Let us find the first fundamental form 
(i.e., the metric), on the caustic surface. The three vector 

X(t, 0) = p(r) cos (0)£ + p(r) sin (<j>)y + z{t)z (217) 

completely describes the surface. The functions p(r) and z(t) are given in Eq. 179, which 
we copy here 



I u 
p = a + -u(t - r )(2r - r ) , z = ±bJ -r 3 (r - r) . (218) 
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Since 



X T = p cos(4>)x + p' sm((j))y + z z , = —p sin(0)£ + p cos((f>)y , 



(219) 



where prime denotes the first fundamental form coefficients for the caustic ring are 



E = X T ■ X T = p' 2 + z' 2 , F = X T ■ X<p = , G = X <t> -X lf> = p 2 



(220) 



Therefore, 



ds 2 = (p' 2 + z' 2 )dr 2 + p 2 d^ . 



(221) 



Using Eq. 179 the explicit metric of the surface is found 



ds 2 



u(3t - At) 2 [u + 



b 2 T 



-]dr 2 + [2a + u(t - r)(r - 2r)] 2 d0 2 



s(r - r 



(222) 



The area A of the caustic surface can be obtained readily since 



A=2 d(f) dr v/det g (223) 
Jo Jo 

where the determinant of the metric is EG — F 2 > 0. Because / is positive definite, its 
coefficients must satisfy E > 0, G > 0, and EG — F 2 > 0. This can easily be verified. 

— * — * — » — » 

Since X T and X^ are independent, X T ^ 0, 0, and hence E = X T ■ X T = \X T \ > 0, 

G = X* • Xs = \XJ 2 > 0. The determinant 



det (g) = EG - F 2 = (X T ■ X T ) (X^ ■ X+) - (X T ■ X+) (X T ■ X+) 

= (X T x X^) ■ (X T x X^) = \X T x X^\ 2 >0 . (224) 



On the caustic surface we have 

detfa) = (p' 2 + z' 2 )p 2 = -^(3r - 4r)> 4 



'2 .'2\,;2 U /o_ A-\2/„. , b 2 T \ |~o„ i „.lJ2 Q_ _ i o_2\l 2 



16 



s(r - t) ' 



[2a + u(t 2 - 3r r + 2r 2 )]' 



For simplicity, if we choose |f = 1, we find the area as 



A 



8vr 
105 



p (35a + 29p) . 



(225) 



(226) 
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The non-vanishing, independent components of the Riemann and Ricci tensors of the 
caustic ring surface are 

pz\p'z" -z'p") ub 2 t (3to - 4r)(2a + u(r 2 - 3r r + 2r 2 )) 

T ^ (p' 2 + z' 2 ) 8 (r -r)(b 2 r + us(r -r)) ' 1 ' j 

z'(j/z"-zy) ub 2 r (3r -4r) 

-ZXt 



p(p' 2 + z' 2 ) 2 (r - r)(b 2 r + MS (r - r))(2a + w(r 2 - 3r r + 2r 2 )) ' 

(228 

_ pgV^ - g fe 2 ro(2a + M (r 2 -3r r + 2r 2 )) 

^ (p' 2 + z' 2 ) 2 " 2 (3r -4r)(6 2 r + MS (r -r)) 2 ' ' ' 

respectively. Components of the Ricci tensor can also be written as 



z'jp'z" - z'p") 

R - = 9 * p(p' 2 + z> 2 ) 2 ■ (230) 



Therefore, the Ricci scalar is 



Z'(p'z"-Z'p") _ -±sb 2 T 

p(p' 2 + z> 2 ) 2 (3r - 4r)(6 2 r + us(r - r)) 2 (2a + u(r 2 - 3r r + 2r 2 )) ' 1 ' 

The Gaussian curvature of the surface is calculated as 

_ R T<t>T4> _ z'jp'z" - z'p") _ 1 
det(^) " p(p' 2 + ^' 2 ) 2 ~2^' 

and will also be obtained as the product of the principal curvatures later in this section. 

In order to calculate the principal curvature radii and the Gauss curvature of the 
surface, we need to calculate the second fundamental form 77. To find //, we need the 
unit normal vector iV = , J -/, , which is a function of r and 6, with differential dN = 
N T dr + N^dcf). Observe that = d(l) = d(N ■ N) = 2 dN • N. Thus dN is parallel to the 

— * 

tangent plane at X. The quantity 

// = -dX-dN = -(X T dr + X^dcj)) ■ (N T dr + N^dcj)) 

= -X T ■ N T d 2 r - (X T -N^ + X^- N T )drd(f) - X^ ■ N^d 2 (f) 
= L d 2 r + 2M drdcj) + N d 2 <p , (233) 
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is called the second fundamental form of X = X(t, 0). L, M and N are called the second 
fundamental coefficients, and are continuous functions of r and 0. We can express the 
second fundamental coefficients in terms of N and the second derivatives of X, using the 
fact that X T and X^ are perpendicular to N for all (r, <p): 

= (X T -N) T = X TT -N + X T -N T ^L = -X T -N T = X TT -N (234) 
= (X^-N)^ = X H -N + X^N^N = -X^N <p = X^-N . (235) 

Using the remaining two identities 

= (X T -N)^ = X T ^-N + X T -N <t> ^-X T -N lj) = X T<t> -N (236) 
= {Xt-N) T = X^-N + X+-N T ^-Xt-N T = X+ r -N . (237) 



we find 



M = ~{X T -Nt + Xf N T ) — X T( p • N — X^ • N . (238) 



Therefore, the second fundamental form 

// = Ld 2 r + 2Mdrd(f) + Nd 2 <f) 

= X TT ■ Nd 2 r + 2X t4> ■ Ndrd<p + X H ■ Nd 2 <p = d 2 X ■ N , (239) 

where we define d 2 X = X TT d 2 T + 2X T( j ) dTd(f)+X ( f >( f ) d 2 <fi. For the caustic ring, the unit normal 

— # — * 

ft = x r x Xj, = p'z- z' (cos (p x + sin (py) 
and the second fundamental coefficients are 

L =*--*=7^- N ^-'^vyfe' (241) 

M = X T(f> ■ N = . (242) 
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Now, let P be a point on the surface, X = X(t, 0) a patch containing P, and 
X = X(r(t),(f>(t)) a curve C through P where t is an arbitrary real parameter in an 
interval I. For instance, 



8{t) = 





dX 


f 


dt 



dt , 



(243) 



where the constant to & I and t < t, can be used to parametrize the curve. Notice that, 
s(t) is the length of the arc segment of the curve between X(t ) and X(t), hence X(s) is 
called "a natural representation" of a curve. Note also that, a representation in terms of 
arc length is not unique, because, it depends on the chosen initial point to (where s = 0). 
The tangent vector t to C at X(s) is defined as 



-» -». . dX . . 

t = t W .-( S ). 



(244) 



where t is a unit vector since 



dX 
ds 



dX dt 
dt ds 



dX ids 
It I dt 



(245) 



From Eq. 243, however, we obtain 



hence Eq. 245 becomes 



ds 
dt 

dX 

ds 



dX 



dt 

dX 
~dtl 



dX 



dt 



(246) 



(247) 



and, |t| = 1, as claimed. We will define all the geometric quantities along the curve in 

— * 

terms of a natural representation, as in the case of the unit tangent vector t, however, all 
of these quantities can be re-expressed in terms of an arbitrary parameter t, by using the 
chain rule and Eq. 246. 

The first derivative of the tangent vector t(s), denoted by k(s), is called the curvature 
vector on C at the point X(s), hence 



k(s) 



dt / 

dS {s 
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d 2 X 
ds 2 ' 



(248) 



or, using Eq. 246 



dt dt 

ds dt i 



dX 



dt 



(249) 



Since t is a unit vector, k = ^ is orthogonal to t: 



= d(l) = — (t t ) = 2— t . 

ds ds 



(250) 



Therefore k(s) is parallel to the normal plane. The direction of k is the direction in which 
the curve is turning. The magnitude of the curvature vector k = |k(s)| is called the 
curvature of C at X(s). In fact, geometrically, k is equal to the rate of change of the 
direction of the tangent vector with respect to the arc length. To prove the claim let us 
call the angle between the unit tangent t(s) at X{s) and t(s + As) at a neighboring point 
X(s + As) as A#. Since t is a unit vector, |t(s + As) — t(s)| is the base of an isosceles 
triangle with sides of unit length. Hence \t(s + As) - t(s)| = 2 sin (^) = A9 + O ((A6») 2 ). 
Then, 



K = 



dt 




ds 





lim 

As^O 



t(s + As) - t(s) 



or, we may write 



Since 



the curvature 



K 



lim 

As^O 



As 

A9 
As~ 



= lim 

As^O 



A9 + Q((A9) 2 ) 
As 



1 + 



lim AO 

As^O 



lim 

As^O 



A6 
0((A9Y) 



K 



lim 

As^O As 



A# 

A^ d6 



0, 



ds 



(251) 



(252) 



(253) 



(254) 



Thus, along a curve that has a rapidly changing tangent direction with respect to arc length, 
such as a with a small radius, the curvature is large. The reciprocal of the curvature is 
denoted by 

R = - = — , 255 

K \k(s)\ 
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and is called the radius of curvature at X(s). A point on C where the curvature vector 

— * 

k = 0, is called a point of inflection. Thus, at a point of inflection the curvature k is zero 
and R is infinite. If k is identically zero along a curve C, then ^ = 0, and by integrating 
we find, t = ^ = a, where a = const. ^ 0. Thus X = as + b, where a = const, which 
implies that C is a straight line through b, and parallel to a. Furthermore, on a surface, 
we can define a normal curvature vector. The normal curvature vector to C at P, denoted 

— * — * 

by k n , is the vector projection of the curvature vector k of C at P on to the unit normal 
vector N of the surface at P: 

k n = (k • N)N . (256) 

The component of k„ in the direction of N, denoted by n n = k • N, is called the normal 
curvature of C at P. The normal curvature can be re-expressed using Eq. 249 



K. 



kiV 



dt 

di 



N 



dX 



dt 



(257) 



Now, using the fact that t is perpendicular to N along C: 



d_ 
~dt 



t-N)=0 



dt 

di 



• N — —t 



dN 
~dt 



we find 



- dN 



dX 



dt 



dX dN 
dt dt i 



dX 



dt 



(258) 



(259) 



— * 

where we replaced t using Eq. 247. The above equation can be written more explicitly as 

, (260) 



- dr d$ 

XT Tt +x *Tt 



A dr - dd) 



- dr # 

X Ut +X *Tt 



- dr - dcf) 

XT Tt +x *Tt 



and hence 



L(dr/dt) 2 + 2M(dr/dt)(d(f)/dt) + N(d(j)/dty 



Kn = E(dr/dt) 2 + 2F (dr/dt)(d<P/dt) + G(d<P/dt) 2 ' 
Observe that function of dr/dt and d<p/dt, depends only upon the ratio 

(dr I 'dt) I \d(f) I 'dt) , namely the direction of dr/dcfr, hence 



K, 



L(dr) 2 + 2M(dT)(d<p) + N(tZ0) 2 
E(rfr) 2 + 2F(dr)(#) + G(ci0) 2 
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77 

T 



(262) 



where (dr) 2 + (d(f>) 2 7^ 0. The two perpendicular directions on the surface, for which 
the K n take maximum and minimum values, are called "the principal directions," and the 
corresponding normal curvatures, K\ and k 2 , are called "the principal curvatures." If K n 
has a maximum or minimum, k*, for (dr*,d<j)*), then 



dK r , 



ddr 



I II dT ~ II I, 



dr 



ddcj) 



(dr*, d<f>*) 



(dr*, d<f>*) 



P 

I II drh - II I c] 



P 



(dr*, d<f>*) 



(dr*, dip*) 







(263) 



Multiplying both equations by /, and using the identity 

K n \(dT*, dip") — K 



II 

T 



(dr*, d<f>*) 



we obtain 



{Ildr — K*Id T )\(dT*, d<f>*) — , {Hd<t> — K*Id4)\(dT*, d<f>*) — . 



(264) 



(265) 



Since II dr = 2(L dr + M d(f>), I dr = 2(E dr + F d(f>), II drj> = 2(M dr + N defy and 
I d4> = 2(F dr + G dtp), Eq. 265 yields 



(L dr* + M d(f)*) - k*(E dr* + F d0*) = , 
(M dr* + N d(f)*) - k*(F dr* + G d<j>*) = . 



(266) 



The above is a homogeneous system of equations and will have a nontrivial solution 
(dr*,d(p*) (or nontrivial principal direction dr*/d(fi*) if 



det 



L — /t*E M - k*F 
M — k*¥ N - k*G 



. 



(267) 



Therefore, by expanding, we find that a number k is a principal curvature, if it is a solution 
of the equation 



EG - F )k - (EN + GL - 2FM))k + (LN — M ) = . 



(268) 
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The discriminant 5 of Eq. 268 is 

5 = (EN + GL - 2FM) 2 - 4(EG - F 2 ) (LN - M 2 ) , 

= (EN - GL) 2 + 4(EM - FL)(GM - FN) 
'EG - F 2> 



E 2 



2F 

EM - FL) 2 + ( EN - GL - —(EM - FL 



(269) 
(270) 



The discriminant 5 > since EG - F 2 > (Eq. 224). We have 6 = if and only if 
EM - FL = and EN - GL - f^(EM - FL) =0, which implies EM - FL = and 
EN — GL = 0, or equivalently, ^ = ^ = ^ = A. Thus, Eq. 268 has either a single 
real root, k — j, if 5 — 0, or two distinct real roots, K\ and k 2 . Dividing Eq. 268 by 
EG - F 2 > 0, we obtain 



K 



2Hk + K = , 



(271) 



where 



\ t , EN + GL - 2FM 

^2 (Kl + K2)= 2(EG-F 2 ) 



(272) 



is the average of the principal curvatures and is called the mean curvature at P, and 

LN-M 2 



K 



KiK 2 = 



(273) 



EG — F 2 ' 

is the product of the principal curvatures and is called the Gaussian curvature at P. Hence 
the roots of Eq. 271 are 

«i, 2 = H ± VH 2 -IC , (274) 



EN + GL - 2FM ± ^(GL - EN) 2 + 4(EM - FL)(GM - FN) 

Kl ' 2 " 2(EG - F 2 ) • (275) 

For the caustic surface, we have F = M = 0; therefore 

IGL-EN 



«1,2 



L N 

E + G ± 



EG 



(276) 



and hence, we choose «i = ^ and k 2 



When explicitly calculated in terms of the 



caustic parameters, they yield 



L 

Kl = E = 



1 b 1 



p'z" - z'p" 

(p' 2 + Z' 2 f' 2 ~ ^2'y/Vsp 



T 


-l 








3-4— 




1 - 


us J 




r o. 











-3/2 



, (277) 
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N z' 1 

K 2 = 7T — / = ± - 

G pJp< 2 + z' 2 P 



where p is given explicitly in terms of the parameters in Eq. 179. Then, we can verify the 

Gaussian curvature found in Eqs. 232: 

z'jp'z" - z'p") 

/C = KiK 2 = , . rr^r = 7 \ • (279) 

P(P' 2 + Z' 2 ) 2 p (3 - 4^ J Mr - r) + 6 2 r) 2 

The principal curvatures, K\ and K2, are the inverse of the principal curvature radii 
Ri and R 2 . The lensing by a caustic surface depends on the curvature radius R of the 
surface along the line of sight. Therefore, we will need the curvature radii of the caustic 
ring surface at an arbitrary point (tti(r 1 ),r 1 ) on the surface. R is given by the Euler's 
theorem: 

l= (cOS " )2 + ^g, (280) 
R R\ R 2 

where 

fl l(Tl )= 1 =-2^./^|3-A (i_(i_£)Ziy (281) 
in the direction of the cross-sectional plane of the caustic ring, 



R 2 (r 1 ) = ^ = ±^J^-l + -(a + ^r -r 1 )(r -2r 1 )) , (282) 

in the direction perpendicular to the cross-sectional plane, and u is the angle between 
the line of sight and the direction associated with R 1 . In this dissertation, we adopt the 
convention that R is positive (negative) if, along the line of sight, the surface curves toward 
(away from) the side with two extra flows. If R is positive, the surface is called "concave" 
(Fig. 12). If R is negative, the surface is called "convex" (Fig. 13). In Eq. 282, the + sign 
pertains if < ^ < |; the — sign pertains if | < ^ < 1. R± is always negative except at 
the three cusps, where it vanishes. R 2 diverges at the cusp in the z = plane. 

For < f < f , there is a pair of lines of sight for which the curvature vanishes. They 
are at angles: 

lu = ± arctan \ — — ^ (283) 
V R\ 
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relative to the cross-sectional plane. Gravitational lensing by a fold of zero curvature is 
discussed in Section 4.2.3. 

3.5.5 Density Profiles of Axially Symmetric Caustic Rings 

As we have seen the inner caustics [18] are closed tubes whose cross-section, shown in 
Fig. 3, is a L>_ 4 catastrophe [16]. They are located near where the particles with the most 
angular momentum in a given inflow are at their distance of closest approach to the galactic 
center. For simplicity, we study caustic rings which are axially symmetric about the z- 
direction as well as reflection symmetric with respect to the z = plane. In galactocentric 
cylindrical coordinates, the flow at such a caustic ring is described by Eqs. 171 and 173. 
Recall that the physical space density is given by Eq. 165: 



The parameters atj(p, z) and Tj(p,z), with j — 1, . . .,n, are the solutions of p(a,r) = p 
and z(a,r) = z, where n(p,z) is the number of flows at (p,z). Outside the caustic tube 
n — 2, whereas inside n — 4. The determinant D 2 (a, r) is defined in Eq. 175. In the limit 
of zero velocity dispersion, the density of dark matter particles is infinite at the location of 
caustic surfaces. Thus, the location of the caustic ring surface is obtained by demanding 
that D 2 {a,r) = in Eq. 176, which implied a(r) = ± v / f r ( r o = r ) wi th < ^ < 1 (Eq. 
177). Hence, we obtained p(r) = a + |(r — t )(2t — r ) and z{t) = ±^a/s r3 ( r o ~~ r ) i n 
Eqs. 179, for the location of the tricusp outline. 

Near the surface of a caustic ring, but away from the cusps, the density profile is that 
of a simple fold: d(a) = -^G(cr), with a > inside the tricusp. Next, we calculate the 
fold coefficient A at arbitrary points other than the cusps, on the surface of a caustic ring. 
As a warm-up, we start with a special point, marked by a star in Fig. 10. We then obtain 
the density profile near the cusps. 




(284) 



where we define 



dM 



as the mass falling in per unit time and unit solid angle. 
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A sample point 

As an example, we determine the fold coefficient A at (p, z) = (a, 0) (Fig. f 0). 



z 




Figure f 0: Cross-section of a caustic ring in the case of axial and reflection symmetry. The 
transverse dimensions in the p and z directions are p and q, respectively. The ring radius 
is a. The star indicates the sample location (p ~ a, z — 0). For clarity, we took p, q ~ a. 
For actual caustic rings, p, g < a. 



Setting a = z = 0, a = p — a = \u{t — t ) 2 and r ~ r , we have 

\D 2 (r)\ ~2b^ . 

Including the factor 2 for in and out flows, Eq. 284 yields 

dM 1 1 

d(<T) 



(285) 



dVtdt ab y/pa 
d 2 M 1 1 



(286) 



Therefore, 

dVtdt ab ^fp ' 

where the subscript is used to indicate that A is being evaluated at the sample point. 



(287) 



To obtain a numerical estimate for A , we again use the self-similar infall model 
[50, 51, 48, 49] with e = 0.2. For the nth ring, we have [18], 

'0.7\ / v vot 



{a n : n = 1, 2, . . .} ~ (39, 19.5, 13, 10, 8, . . .)kpc 



Ji- 



0.27 J V h J V220km/s 



, (288) 
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where j max is a parameter, with a specific value for each halo, which is proportional to the 
amount of angular momentum that the dark matter particles have [48, 49]. Also, 



d 2 M 



= fnv n ^ , (289) 



2 



dQdt 

where v n is the velocity of the particles in the nth caustic ring, and the dimensionless 
coefficients /„ characterize the density of the nth in and out flow. In the self-similar model 
[18], 

{/„ : n = 1, 2, . . .} ~ (13, 5.5, 3.5, 2.5, 2, . . .) • KT 2 (290) 

for e = 0.2. The f n are like the F n in Eq. 158, but they describe the nth in and out flow 
near the caustic ring, whereas the F n describe that flow near turnaround. 
Combining Eqs. 287 and 289, we have 



2 

4),n = ^^-L ■ (291) 

It was shown in reference [16] that b n and v n are of the same order of magnitude. Moreover, 
Sikivie [23] interpreted the ten rises in the rotation curve of the Milky Way as the effect of 
caustic rings. In that case, the widths p n of caustic rings are determined from the observed 
widths of the rises. Typically one finds p n ~ 0.1 a n . Using this and v n ~ b n , Eq. 291 yields 
the estimates 

10~ 4 gr 



{4>,n:n = l,2,...}~(3, 4, 4, 5, 5,...) 



cm 2 kpc 1 / 2 



0.27 N 



JmaJ V0.7; V220km/sy V ; 

At the point under consideration, (p, z) = (a, 0), the surface of the caustic ring is convex 
for all lines of sight (i.e., all tangents at that point are on the side with two extra flows). 
If the line of sight is in the z = plane, the curvature radius is a. If the line of sight is 
perpendicular to the z = plane, the curvature radius is 2^p. Lensing by a convex caustic 
surface is discussed in Section 4.2.2. 
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To obtain the lensing properties of the caustic ring surface at an arbitrary point, we 
need the curvature radii R given in Eq. 280 and the coefficient A at all locations. We 
derive A everywhere on the surface in the next two sections. 

The fold coefficient everywhere 

We choose an arbitrary point on the tricusp (i.e., on the surface of the caustic ring). 
Its parameters are (eti, t±) with ai given in terms of T\ by Eq. 177. The physical coordinates 
(pi,zi) are given in terms of T\ by Eqs. 179. We assume that the point is not at one of 
the three cusps. The latter are located at T\ = 0, at T\ = |r with ct\ > 0, and at T\ = |r 
with a± < 0. 

The vanishing of D 2 (ti) = detP(ri) implies the existence of a zero eigenvector of the 
matrix 

*™ = ( } £ ) ■ (-) 



Let us define 6 l i(r 1 ) such that 



cos (6 



v(n) [ sm Yni ) = . (294) 



We have 

sin(gi) = — = bn and cos (gQ = -= SCXl . (295) 

v/(6n) 2 + ( sai y v/(6n) 2 + (s ai y 

We define new Cartesian coordinates (a, 77) related to (p — pi, z — z\) by a rotation of angle 

0i + f: 

\ ( -c-os(g| ) \ ( (> - pi 1 ^ g6 ^ 



77 J V cos(^i) — sin(^i) y V 2 — 
We now show that a is the coordinate in the direction orthogonal to the caustic surface at 

(Pl,Zl). 
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Consider small deviations about (cci, T\) in parameter space: (a, r) = (ai + Aa, T\ + 
At). Equation 175 implies 

{%)= (ir) + 0(A ° 2 ' A ^ AaAT) ' (297) 

where T indicates transposition. The expansion of a in powers of Aa and At yields 

a = 0(Aa 2 , At 2 , AaAr) , (298) 

because the first order terms vanish: 



da 
da 



Aa+ — 



AT = -(An Ar)P(n)( ! = . (200) 



(aiTi) 

The fact that a is second order in Aa and At shows that a is the coordinate in the 
direction perpendicular to the caustic surface, and 

g — — sin {9\)p — cos (0\)z (300) 

is the unit normal to the surface, pointing inward. 6{ji) is the angle between the p axis 
and the tangent line at (pi,zi) (Fig. 11). To obtain the density profile of the caustic near 
the point under consideration, we need D 2 (rj, a) to order ^fo. So we calculate a to second 
order in powers of Aa and At, and D 2 and r\ to first order. We find 

o = \{Aa At) ( f n ^\ - bC ° S %\ )( Aa ), (301) 
2 V 1 \ -6cos(6>i) -usin(^i) J \ At J ' v 1 

and 

D 2 = -6 [2saiAa + «(2ri - r ) At] 
i] = [u{ji — To) cos^i — 6a i sin#i]Ar — [sa± cos#i + 6ti sin#i]Aa . (302) 

Eqs. 302 can be inverted to obtain Aa and At as functions of D 2 and 77. When the result 
is inserted into Eq. 301, we obtain 

.2„2 



6 If ,D 2 2 ustqt^ti 

= 2 s /( b r 1 f + (s ai? u Tl[i r s> -ArA Tl[ - ) " 1 ' (3 ° 3) 
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t 

z 




Figure 11: An arbitrary point on the tricusp is labeled by T\ with ol\ = oti(T\). Its physical 
coordinates are (p 1 , z\). A new Cartesian coordinate system (a, r/) is defined there such that 
a is perpendicular to the caustic surface. It is rotated relative to the (p, z) cooordinates 
by an angle 9{ti) + f . 



This implies 



D 2 (r), a) = x j f_ V 2 + Wu\3t - ^^(ro-rOn + r? a . (304) 

V us ) TO 



Along the a direction (77 = 0), we have 



D 2 (a) = 2C(r 1 )b y /^ , (305) 

with 



C(n) = ,/|3-4Zl|J" ( i-Zl)Zl + ( S ) . . (306) 

T V r r TO 

Combining Eqs. 284 and 305, and minding the factor of two because two flows contribute, 
we have 

d{T 1 ,a) = -^-Q{a) (307) 



with 



A(r) _ d2M 1 cos(ai) 

1 1J dQdtbCin)^ pin) ' 1 
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In terms of A 0n = A n {ji = r ), for which estimates are provided in Eq. 292, we have 

A n (n) = A^ n . (309) 

Note that A(t±) diverges at each of the three cusps because C vanishes there. The caustic 
ring parameters (a, r, b, u, s) are related to the velocity distribution of the flow at last 
turnaround [16]. 

Density near a cusp 

In this section, we derive the dark matter density profile near a cusp. For the sake of 
convenience, we choose the cusp in the z = plane at p = a + p = p , where a = r = 0. 
Very close to the cusp, we may neglect the term of order r 2 in Eq. 173. Hence, 

z = bar , p = p — ut t a 2 . (310) 

2 

The term of order a 2 cannot be neglected. We define new dimensionless quantities [16]: 

A=-fi, T = — , X= P -^, Z= Z -^, (311) 

70 V U T p p 

where ( = p 1 . In terms of these, Eq. 310 becomes 

Z = 2AT , (312) 
X = -2T - A 2 . (313) 

The determinant of the Jacobian in Eq. 176 becomes 

D 2 (A, T) = 2bp [T - A 2 ) . (314) 

Substitution of Eq. 312 into Eq. 313 yields the third order polynomial equation: 

A 3 + XA + Z = . (315) 

The discriminant is: 

Z\ 2 /X x 3 



' = I T + la- 1 ■ (»») 
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If 5 > 0, the cubic equation has one real root, and two complex roots which are complex 
conjugates of each other. If 5 < 0, all the roots are real and unequal. For 5 = 0, all 
the roots are real and at least two are equal. The number of real roots is the number 
of flows at a given location. The tricusp has two flows outside and four inside. In the 
neighborhood of a cusp, however, one of the flows of the tricusp is nonsingular and does 
not participate in the cusp caustic. To include the root corresponding to the nonsingular 
flow near (z,p) = (0,p ), one must keep the term of order r 2 in Eq. 310. 

The equation for the caustic surface in physical space is 5 = 0. Indeed, Eq. 314 
implies that at the caustic T = A 2 . Therefore, Z = 2T 3 / 2 and X = — 3T. A straightforward 
calculation shows that 

5 = -^M 4 + 7A 2 T-8T 2 ) . (317) 

Thus D 2 = implies 5 = 0, however, the converse is not true: 5 = does not imply D 2 = 
because not all flows at the location of the caustic surface are singular. 
Eq. 284 for the density becomes 



where the sum is over the flows (i.e., the real roots of the cubic polynomial Eq. 315). If 
5 > 0, the one real root is 



This describes the one flow outside the cusp. Using Eqs. 313 and 319 in Eq. 318, we 
obtain 




(318) 



A = (~ + v^) 1/3 + (-f -V5)^ 



(319) 



1 d 2 M 1 



(320) 



p bp dQdt \X - 3(-f + V5) 2 / 3 - 3(f + V5) 2 / 3 



Just above or below the cusp, where X = and \Z\ <C 1, we have 



d = 



1 d 2 M 1 



(321) 



3bpp dttdt\Z\ 2 / 3 



92 



On the other hand, if we approach the cusp in the plane of the ring from the outside, where 
Z = and < X < 1, we find 

1 d 2 M 1 , N 

b p dfldt (p — p ) 

Next, we calculate the density inside the cusp, where 5 < 0. The three real roots of the 
polynomial Eq. 315 are 



.4, ;—^ vo> (323) 



A 2 ^/-^cos(0 • |li (324) 



A 3 = 2y^cos(^ + y), (325) 

where cos 36* = — ^ (— -|) and < < |. Inserting them into Eq. 318 and using Eq. 
313, we obtain 

1 d 2 M ( 1 \ / 1 1 1 

d = 1 ~ T77/T7 ^7 7 77777; 7 + 7777 ^7 7 + 



bppodVldt \-X J \4cos 2 (6) - 1 4cos 2 (# + f)-l l-4cos 2 (# + f) 

(326) 

where each term in the parentheses corresponds to one of the three flows. Adding the 
individual flow densities yields 

d _ 2 d 2 M 1 1 
~ bpp dttdt\X\ (v/3-tan0) sin 29 

If we approach the cusp in the galactic plane from the inside, where Z = and < —X <C 
1, we have 

2 d 2 M 1 /oooN 
b po dilcft (po — p) 

which is twice the result in Eq. 322. The gravitational lensing properties of a cusp are 
derived, for a special line of sight, in Section 4.2.4. 
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4 Gravitational Lensing by Dark Matter Caustics 

Gravitational lensing techniques have proven useful in studying the distribution of dark 
matter in the universe. They have been used to reveal the existence of massive compact halo 
objects (MACHOs) in galaxies [10, 11, 12, 13], and to constrain the mass distribution in 
galaxy clusters [56, 57, 58]. In this section, we calculate the gravitational lensing properties 
[15] of dark matter caustics [18, 16, 19]. Gravitational lensing by dark matter caustics has 
been discussed by Hogan [22]. We confirm Hogan's results for the case he considered, 
called the "concave fold" in our nomenclature. We study additional cases and introduce a 
formalism to facilitate the calculations. 

The gravitational lensing effects of a caustic surface are largest when the line of sight 
is near tangent to the surface because the contrast in column density is largest there. The 
effects depend on the curvature of the caustic surface at the tangent point in the direction 
of the line of sight: the smaller the curvature, the larger the effects. A caustic is an oriented 
surface because one side has two more flows than the other. We will consider three cases 
of gravitational lensing by a smooth caustic surface. In the first case, the line of sight is 
near tangent to a caustic surface which curves toward the side with two extra flows, as 
in Fig. 12. We call such a surface "concave." In the second case, the surface is "convex" 



Figure 12: Lensing by a concave fold. The arc is the intersection of the caustic surface with 
the plane containing the normal (z) to the surface and the line of sight (y). The shaded 
area indicates the side with the two extra flows. 



A 



z 




> 
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(i.e., it curves away from the side with two extra flows as in Fig. 13). In the third case, the 

z 




Figure 13: Lensing by a convex fold. Same as Fig. 12 except that now the caustic surface 
curves away from the side with two extra flows. 

caustic surface has zero curvature at the tangent point (the radius of curvature is infinite), 
but the tangent line is entirely outside the side with two extra flows. Caustic surfaces may 
have cusps. The outer dark matter caustics of galactic halos are topological spheres which 
have no cusps, but the inner dark matter caustics of galactic halos are closed tubes whose 
cross-section is a _D_ 4 catastrophe (tricusp; Fig. 10). The fourth case of gravitational 
lensing which we consider has a line of sight near a cusp, and parallel to the plane of the 
cusp (Fig. 14). 

Gravitational lensing produces a map of an object surface onto an image surface. 
The magnification is the inverse of the Jacobian of this map. Because dark matter caustics 
have well defined density profiles, it is a neat mathematical exercise to calculate their 
gravitational lensing characteristics. The images of extended sources may show distortions 
that can be unambiguously attributed to lensing by dark matter caustics in the limit of 
perfect observations. We see that in three of the cases considered, a point source can have 
multiple images. In those cases, when two images merge, the Jacobian of the map vanishes 
and the magnification diverges. So, at least in theory, it seems that gravitational lensing is 
a very attractive tool for investigating dark matter caustics. Observation of the calculated 
lensing signatures would give direct evidence for caustics and CDM. 
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Figure 14: Lensing by the caustic ring cusp at (z,p) = (0,po), with line of sight parallel 
to the z = plane. We define x = p — p . A) Side view in the direction of the line of 
sight. The latter is represented by the dot near x = z = 0. B) Top view. The curve is the 
location of the cusp in the z = plane. 

We have been particularly motivated by the possibility [22] that the observer might 
be able to distinguish between axions and WIMPs by determining the distance over which 
the caustics are smeared. The nearby caustic, whose position is revealed according to 
ref. [23] by a triangular feature in the IRAS map of the Milky Way plane, is only 1 kpc 
away from us in the direction of observation. By observing the gravitational lensing due 
to that caustic, one may be able to measure Sx as small as 10 13 cm, assuming an angular 
resolution of 3 • 1CT 9 radians. If 5x turned out to be that small, the WIMP dark matter 
hypothesis would be severely challenged (Eq. 16). Unfortunately, as will be shown below, 
the gravitational lensing due to a caustic only a kpc away from us is too weak to be observed 
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with present instruments. It is well known that gravitational lensing effects are proportional 
to Dl q s ls where D S ,D L and D LS are respectively the distances from the observer to the 
source, from the observer to the lens, and from the lens to the source. We see below that, 
for the gravitational lensing effects of dark matter caustics to be observable with present 
technology, the lenses and sources must be as far as possible, at the cosmological distances 
of order Gpc. Even then, the observation of such effects will be difficult. Unfortunately, at 
Gpc distances it is not possible to measure Sx as small as Eqs. 15 and 16 with foreseeable 
technology. So it seems unlikely that one will be able to distinguish between dark matter 
candidates on the basis of the gravitational lensing characteristics of the caustics they form. 
Henceforth, unless otherwise stated, the velocity dispersion is set equal to zero. 

4.1 General Formalism 

The first part of this section gives a brief account of the gravitational lensing formalism 
[59] . In the second part we show how this formalism can be streamlined in the special case 
of lensing by dark matter caustics. 

In linear approximation, the deflection angle 9 of a light ray due to a gravitational 
field is given by 




(329) 



where <3> is the Newtonian potential. We choose the y-axis in the direction of propagation 
of light. The deflection angle 9 is related to the angular shift £/ — £ s on the sky of the 
apparent direction of a source: 

= - &) (330) 

where D s and D LS are the distances of the source to the observer and to the lens respec- 
tively. £s is the angular position of the source in the absence of the lens while is the 
angular position of the image with the lens present. The angles carry components in the 
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x and z-directions: 6 = (9 X ,9 Z ), £ = (£e,£ 2 ), etc. Unless otherwise stated, we mean by a 

— # 

vector a quantity with components in the x- and z-directions. We have x = (x, z) = Dl^j. 
It is convenient to introduce a 2D potential 

Kto-wOikh* • (331) 

so that 

= ^VfrlKS) , (332) 

where = -DlV, and -Dl is the distance of the observer to the lens. Then, Eq. 330 
becomes 

S = & + ^(£) • (333) 

It gives the map £s(£/) from the image plane to the source plane. The inverse map may 
be one to one, or one to many. In the latter case, there are multiple images and infinite 
magnification when a pair of images merge. 

Our problem is to find the image map of a point source when the matter distribution 
is given. The 2D gravitational potential ip obeys the Poisson equation: 

where £(£r x ,£r z ) is the column density (i.e., the integral of the volume density along the 
line of sight): 

S(5) = J 'dyd(D L S Ix ,y,D L S Iz ) , (335) 
and S c is the critical surface density 

E -«ik^ 347g/cm2 G^ G ^ < 336 > 

A uniform sheet of density S c focuses radiation from the source to the observer. 
Equation 334 is solved by 

*& ) = lkj ^J^HS-Sl , (337) 
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and hence, 

— * — * 

^= 5 - is = V^(£) = -±rf d% S(g) J J " ^ . (338) 

The image structure, distortion, and magnification are given by the Jacobian matrix of the 
map £s(£j) from image to source: 

K V = 7^ = ~~ ^3 ' ( 339 ) 

where = ggpogp - Because gravitational lensing does not change surface brightness, the 
magnification M. is the ratio of image area to source area. Therefore, 

To first order, for ^ <C 1, 

M = 1 + V^V = 1 + 2 |" • ( 341 ) 

£, 



c 



To obtain the largest lensing effects, we wish to minimize S c , given in Eq. 336. For fixed 
Ds, the minimum occurs when the lens is situated half-way between the source and the 
observer. Also, Ds should be as large as possible. For our estimates, we will assume that 
the source is at cosmological distances (e.g., 2D L = 2D LS = D s = lGpc, in which case 
S c = 1.39 g/cm 2 ). 

For a general mass distribution, the gravitational lensing effects are obtained by first 
calculating the column density, Eq. 335, and then the image shift, Eq. 338. This procedure 
can be simplified when the lens is a dark matter caustic. We are interested in lines of sight 
which are tangent to a caustic surface, because the column density £ has the highest 
contrast there. We assume that, in the neighborhood of the tangent point, the flow is 
independent of y except for a shift x(y) of the caustic surface with y. The density can then 
be written as 

d(x,y,z) = d(x - x(y),z - z(y)) , (342) 
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where d(x, z) is the density of the 2D flow in the plane orthogonal to the line of sight. The 
flow in the y-direction is irrelevant because lensing depends only on the column density E. 

In the limit of zero velocity dispersion, a 2D flow is specified by giving the spatial 
coordinates x(a,/3,t) of the particle labeled (oi,/3) at time t, for all (a,/3,t). The labels a 
and (5 are chosen arbitrarily. At a given time t, x(a,/3,t) = x has a discrete number of 
solutions (a, (3)j(x,t) labeled by j — 1, . . .,n(x,t), where n is the number of distinct flows 
at (x,t). Here, t is the time at which the light ray passes by the caustic. Henceforth we 
will not show the time dependence explicitly. The particle density in physical space is 



n(x) 2 



d(x,z) = J2 



(343) 

(a,/9)=(a,/?),(a) 



dadd I gfef) 

3=1 ' \d(a,(3)\ 

where A is the mass per unit length in the direction of the line of sight and is the A 
density in parameter space. Inserting Eqs. 335 and 342 into Eq. 338 we obtain 

VfrlKS = x/D L ) = ±-J dy j d 2 i\ | J ~^ d(DU' Ix - x(y), D L £ Iz - z{y)) . (344) 

When d is replaced by the caustic density Eq. 343, Eq.344 becomes 



VtMb = x/ d l) = 

1 d 2 A 



n^ c D L j dy j dX ' dZ ' (x-x') 2 ^\^l\dad(3 

c L J J V ' j=l I d{a,P) I y 



. (345) 

(a, / 9)=(a^) J .(x'-x(j,)) 

Changing variables from (x 1 , z') to (a, f3) and assuming that the density in parameter space 
varies only slowly over the region of integration, we rewrite Eq. 345 as 

^ x 1 d 2 A f , f , x — x(a, 3) — x(y) . . _ \ 

V^(6 = = ——jiy] iadP |g _ ^ ■ W_ ^ . (346) 

Further simplification is achieved by defining the complex integral: 

J(x, z) = f dadd t — — — — t — -r- , (347) 

v ' J x - x(a,/3) +i(z - z(a,/3)) v ' 
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in terms of which the shifts are given by 



dib 1 d 2 A 

A& = -J- = Re / dy I(x - x{y)) , (348) 



d^j x 7rT, c D L dadj3 

^ = ^rr-^k^ lm I dvI{1 -" {v)) ■ (349) 

Eqs. 347 and 349 are useful when the caustic has contrast in the two dimensions transverse 
to the line of sight (e.g., near a cusp). 

In many applications, however, the caustic has contrast in only one of the dimensions 
transverse to the line of sight. Choosing x to be the trivial direction, Eq. 334 is reduced 
to 

||(&) = ^ E(* = DUj) , (350) 
and the column density is given by 

= J dyd(z-z(y)) . (351) 
The flow is now effectively one dimensional. Its physical space density is given by 



d(z) = J2 



n(z) dA 1 



da 1^1 

j=\ I da 1 



(352) 

a=a j (z) 



where A is the mass surface density in the two trivial directions (x and y) and is the A 
density in parameter space. The ID Green's function is G = |(|^| + a£) + 6, where a and 
b are arbitrary constants. The shift is 

jjrfa) = %j d ^ ^i) (Sign(6 - Q + a) . (353) 

The constant a causes an overall shift of the image, which does not concern us. We choose 
a = —1. Repeating the steps done earlier for the 2D case, Eq. 353 is re-expressed as 

A£ = j dy j da B(-z + z(a) + z{y)) . (354) 

In the next section Eq. 354 is used to calculate the shifts due to simple folds of dark matter 
flows, and Eq. 349 is used to calculate the shifts due to a cusp. 
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4.2 Applications 

4.2.1 Lensing by a Concave Fold 

We consider a simple fold caustic which has a curvature radius R along the line of sight. 
We assume that the surface is concave (i.e., it is curved in the direction of the two extra 
flows; Fig. 12). The outer caustics of dark matter halos are examples of concave caustic 
surfaces. The convex case is discussed in the next subsection. 

In the neighborhood of the point P of closest approach of the line of sight with the 
caustic surface, we choose coordinates such that z is perpendicular to the surface while x 
and the direction y of the line of sight are parallel. P has coordinates x — y — z — 0. In 
the y = plane, the flow is given by z(a) = —^ha 2 , where h is a positive constant. Using 
Eq. 352 we find the density in the y = plane: 

d(a) = A ^2 , (355) 

where a = — z, and 

/2 dA , or „, 

A =ihT a ■ < 356 > 

2 

For ?/^0, the density is still given by Eq. 355, but with a = z{y) — z and z(y) = — 
We calculate the shift using Eq. 354: 



/ dy daOi- 

e(-6)6 • (357) 



Z c D L daJ_^ m J V 2 2R 

4vrdA [R 
Y* r da\ h 



Hence 



with 



AZ = Zi-Zs = vtie(-Zi) , (358) 



2nAV2R 

V = — • (359) 
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One can also obtain this result by calculating the column density: 



dy d(y, z = f j) 



dy 



= 7iAV2RQ(-£ I ) 



(360) 



2R 



and solving Eq. 334: 



(361) 



Eqs. 360-361 were first obtained by Hogan [22]. The agreement with Eq. 358 validates the 
formalism derived in Section 4.1. Figure 15 plots the source position £ s versus the image 
position £j. 




Figure 15: Source position £s as a function of image position £/ for lensing by a concave 
fold. 77 is given by Eq. 359 in terms of the fold coefficient and the curvature radius of the 
caustic surface. Estimates of rj for the outer caustics of galactic halos are given in Eq. 363. 



The magnification is: 



M = = 1 + v ©(-0) + 0(^ 2 ) 



(362) 
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When the line of sight of a moving source crosses the surface of a simple concave fold, the 
component of its apparent velocity perpendicular to the fold changes abruptly. Also, a 
discontinuity occurs in the magnification of the image. Both effects are of order rj. 
We estimate i] for the outer caustics of galactic halos using Eqs. 156-159: 



{Vn = ^k^ :n=l,2,...}~(7, 6, 6, 6, 6, . . .) • 1(T 3 



G Ti c R n 



DlDlS \ ( Vrot \ (h_ 

D s Gpcy V220km/ S y V0.7 



(363) 



A magification of order 10~ 2 seems hard to observe, however, the images of extended sources 
may be modified in recognizable ways. In particular, straight jets would be seen with an 
abrupt bend where their line of sight crosses a fold. Indeed, the image is stretched by the 
factor 1 + rj in the direction perpendicular to the caustic, on the side with the two extra 
flows. If the jet makes angle a with the normal, it appears bent by an angle 5 = \r\ sin (2a). 
Searching the sky for bends in extended sources may be a realistic approach to detecting 
caustic structure in galactic halos [22]. 

4.2.2 Lensing by a Convex Fold 

By definition, a convex fold is curved in the direction opposite to the side with two extra 
flows (Fig. 13). Using the conventions of the previous subsection, we write the equation 

2 

for the displacement of the surface along the line of sight as z(y) — |^ , and that for the 
flow as z(ct) = —\ha 2 , for small z, with R,h>0. Eqs. 355 and 356 still apply, with 
a = y 2 /2R - z . Eq. 354 yields the shift: 



^ = -^s{ e ^f* +e(&) £Ur*}vs@- Di& ) ' (364) 

We introduce a cut-off L for the integral over large y. L can be thought of as the length 
scale beyond which our description of the flow is invalid. The above equation becomes 

^ = 4= \^ V-2RD Lil - 2RD Lil In ( ^^-2RD Lil 

Y> c D L da^hR\ V ^ y/2RD L \^\ 

(365) 
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By expanding Eq. 365 in powers of ^, and using Eq. 356, we obtain the ^/-dependent 
shift: 



7T 



In 



RD L \ii 
2L 2 



6 



where 77' (like r] in Eq. 359) is given by 



2nAV2R 



(366) 



(367) 



The magnification is 



M 



1 + 1 In 

7T 



RDl\Si 

2L 2 



(368) 



The cut-off L has an effect on both the magnification and the elongation of the image in 
the direction normal to the caustic surface, but that effect is £/ independent. L has a global 
effect on the image, as opposed to an effect localized near = 0. Figure 16 plots £ s versus 
It shows that a convex fold can cause a triple image of a point source. In particular, 
when the source is exactly behind the caustic (£g = 0), the images are at = — £ c , 0, and 
+£ c , with 



2L 2 



7l 



exp 



1) 



(369) 



RD L - r v 77' 

A point source has a single image sufficiently far from the caustic, say at £ n > 0. When the 
line of sight approaches the caustic surface tangent point, two new images appear on top of 
each other at £72 = £73 = — £ c /e. At that moment, the magnification at £12 is infinite, and 
£,s — v'ic/^- As the source crosses the caustic, £/ 2 moves toward £n and finally merges 
with it. When £ n = £/ 2 = +^c/e, the magnification diverges again. After that, only the 
image at £73 remains. 

Let us apply the above results to the line of sight in the plane of a caustic ring at the 
sample point, (z — 0, p — a), discussed in Section 3.5.5. Setting R = a n and using Eq. 291 
we obtain for the nth ring 



V'n 



2ir A^, n ^2a' n n 



rot 



fn Vn 



V2GY, C y/CLnPn K 
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(370) 




Figure 16: Source position £ s as a function of image position for lensing by a convex 
fold. £ c is given in Eq. 369. 



Using Eq. 292 to estimate A ,„, we find 
K:n = l,2,...}~(5, 4, 4, 4, 4, . . .) • 10' 



2 D L D LS {0.27 



D s Gpc \j 



h 



v rot 



0.7 J V220km/s 



(371) 



For such small values of rj' the angular distance between the triple images is exponentially 
small and unresolvable with present and foreseeable instruments. 

The image of an extended object is stretched in the direction perpendicular to the 
caustic by the relative amount 

'RD L \b\ 



M - 1 



rj 



In 



7T 



2L 2 



(372) 



The image is stretched for £j < £ d , and compressed for £/ > £ d , where 

2L 2 



RD, 



(373) 
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In the case of the sample point (z — 0, p — a) with line of sight in the z = plane, the 
cut-off length L (i.e., the distance in the y-direction over which our desciption of the flow is 
valid), is of order y/2ap. In that case ^ ~ Ap/D L . Since p/D L is the transverse angular size 
of the caustic ring, our description certainly fails for £j > p/D^. So, over the region where 
our calculation is valid, the image is magnified. The effects are generically of order one 
percent, increasing to several percent when £/ ~ 10 -3 £d, for caustic rings at cosmological 
distances. 

4.2.3 Lensing by a Fold with Zero Curvature 

We saw in Section 3.5.4 that the surface of a caustic ring has tangent lines along which 
the curvature vanishes. One may speculate that the lensing effects of a caustic surface are 
strongest when the line of sight is tangent to the surface in a direction of zero curvature, 
because the line of sight stays close to the caustic over greater depths in that case. If the 
line of sight of an observer looking at the outside profile of a caustic ring is at some point 
on the profile in a direction of zero curvature, then the equation for the intersection of the 
caustic surface with the plane containing the outward normal to the surface (z) and the 
line of sight (y) is z(y) = —j^y where U is positive and has dimensions of (length) 3 . In 
such a case, the cubic term in the Taylor expansion of z(y) is absent because the line of 
sight remains everywhere outside the caustic ring tube. We did not calculate U for caustic 
rings, but expect U ~ (kpc) 3 in order of magnitude. The flow is given by z(a) = —^ha 2 
with h > 0, as before, and Eq. 356 holds. 

Using the above expressions for z(y) and z(a) in Eq. 354, we find the shift 



A£ = 




(374) 
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where 

£o = (9.89 |-)4 . (375) 

The magnification is: 

-i 

(376) 

Figure 17 shows £s versus £/. 



/ 






r 



Figure 17: Source position £s as a function of image position £/ for lensing by a fold with 
zero curvature. £ is given in Eq. 375. 

Triple images occur when < £ - Unfortunately, for the zero curvature tangents 
of caustic rings envisaged above, £ is very small. For 2D L = 2D LS = D s = Gpc, A = 
3 ■ 10~ 4 — — — r , and U = (kpc) 3 , one finds £ — 4 • 10~ 17 . Hence the triple images cannot 

cm 2 kpc^ 

be resolved. Also, even at angular distances as small as £j ~ 10~ 9 , the magnification and 
image distortion is only of order 1%. 



M 



1/4 
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4.2.4 Lensing by a Cusp 

In this section, we investigate a line of sight parallel to the plane (z = 0) of a caustic ring, 
and passing near the cusp at p = po (Fig. 14). We use the 2D lensing equations derived 
in Section 4.1. The shifts are given by Eq. 349 in terms of the complex integral / of Eq. 
347. Using Eq. 310, we have 

/ = [ da [ dr j— , 1 i _ : r . (377) 

J J p — a — ^uTq + utqt + ^sa z + %\z — bar) 

Because we are close to the cusp, the term of order r 2 is neglected in the denominator of 

the integrand. In terms of the parameters defined in Eqs. 311, 

I=^ldT r A9 — ; j,rp d ^ ; - ; .- . (378) 



bVC J J-oc A 2 - + 2T + X + i : 
The integration over A yields 

2tt f dT 



b VCJ^<c lll + 2T + x + l z_ 



(379) 



where C = Re yj^- + 2T + X + i-^ > . Near the cusp (X, Z < 1) the terms of order 
T 2 can be neglected since they are unimportant in the denominator of Eq. 378. Equation 
379 becomes then 

I(X,Z) = £L[ dT , (380) 

6 VCi J2T + X + i^ 

where the integration domain is defined by the inequality: 

CZ 2 

T 4 -C(2T 3 + XT 2 ) - < . (381) 

Let us call Tj (i — 1, . . .,4) the roots of the polynomial on the left hand side of Eq. 
381. Near the cusp, three of the roots T; L are near zero and one of the roots is close to 2(. 
Let us call the latter T 4 . In order to find the roots near T = we neglect the quartic term 
and solve the cubic equation: 

X Z 2 

T 3 + — T 2 + — = . (382) 
2 8 
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This equation is also obtained by eliminating A from Eqs. 312 and 313. Hence the solutions 
of Eq. 382 are the values of T for the flows near the cusp. As was discussed in Section 
3.5.5, the number of flows is determined by S, given in Eq. 316. Outside the cusp, where 
5 > 0, there is a single root 7\. Inside the cusp where 5 < 0, there are three roots T 1; T 2 
and T 3 . Figure 18 shows the quartic polynomial, Eq. 381, for various values of X and Z, 
and ( = 1. For a line of sight just outside the cusp, the quartic polynomial is negative 




Figure 18: Graphs of the quartic polynomial P{T) = T 4 — C(2T 3 + XT 2 ) — for ( = 1 
and (X,Z) = (0,0), (0,2), (1/3,0) and (-1/3,0); a) through d) respectively. 

from Ti to T 4 with 




(383) 
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Eq. 380 becomes in that case: 

2tt f 



Z 



mf T ' +x+t v? 



'2T ± + X + i 



7t) 



(384) 



Let us consider the line of sight defined by Fig. 14. Equation 349 gives the shift in 
the direction perpendicular to the plane of the cusp as 

1 d 2 A 



7t£ c .Dl dadr 



Im / dy I(X-X(y),Z) 



(385) 



where X(y) 



is the shift of the cusp as a function of depth. The integral in Eq. 



385 with the integrand given by Eqs. 383 and 384 can be done numerically. Here we only 
give a rough estimate, to determine the order of magnitude and qualitative properties of 
the lensing effects. 

For X = and Z < 1, we have 

1 



2tt 

T 



2-^Sign(Z)|Z| 1 / 3 -4|Z|I 



+ 0(Z) 



(386) 



Because the scaling law X 3 ~ Z 2 holds close to the cusp, we expect Eq. 386 to be valid as 

2 

long as the shift in the x-direction \AX\ < \Z\s. Hence Eq. 386 provides a good estimate 
of /(X -X(y), Z) over a depth 2 Ay with Ay ~ ^/ZptiP \Z\*. Therefore 

/2p p d 2 A 



Since we are near a — 0, 



S C Z?£,6 V C dadr 



\Z\3 Sign(Z) 



(387) 



d 2 A 



1 d 2 M 1 d 2 M 



dadr 27cp dadr po dVtdt 
Using this and Eq. 287, we obtain 



(388) 



A&~7/'|£/*l s Sign(£j*) 



where 



// 



2V2a 



V = 



(389) 



(390) 
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The contribution to the magnification from distortion in the z-direction is therefore: 

-l 



M, 



Figure 19 plots £sz versus £/ 2 : 



(391) 




Figure 19: Source position £j 2 as a function image position £/ z for lensing by a cusp along 
the line of sight described in Fig. 14. £i ~ r/" 3 where 77" is given by Eq. 390. 



For £j z < £i ~ i]" 3 , a point source has triple images. For the nth caustic ring, rj" can 
be estimated using Eqs. 288 and 292, and assuming p ~ 0.1a, ( ~ 1: 



{<: n = 1,2,...}- (1.5, 1, 0.7, 0.6, 0.5, 



0.27 V / h \ « 



10 



-4 Dl D LS 



D s Gpcs 

4 

^rot x 3 



(392) 



Jmax/ VO.7; V220km/ S/ 
At cosmological distances, the typical angular separation between the images is of order 
10~ 12 . Unfortunately, here again the images are too close to be resolved, however, for an 
angular distance £/ ~ 10~ 9 , M z is of order 10%, which may be observable. The lensing 
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properties of a cusp for other lines of sight can be calculated numerically using Eqs. 349 
and 377. The latter equation is replaced by Eqs. 384 and 383, and T 4 = 2£, if the line of 
sight is everywhere outside the region with two extra flows. 
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5 Conclusions 



We reviewed the leading cold dark matter (CDM) candidates: axions and weakly inter- 
acting massive particles (WIMPs) in Section 2. We discussed each candidate's status in 
particle physics and cosmology, its production in the early universe, its energy density and 
velocity dispersion. We found that the velocity dispersions of the candidates are negligible 
on astronomical scales; hence, they indeed are "cold." 

In Section 3, we discussed the appearance of caustics in the flow of CDM particles in 
and out of a galaxy. Caustics are surfaces in space where the density of CDM diverges in 
the limit of zero velocity dispersion. In this limit the particles lie on a 3D sheet in 6D phase 
space. The physical space density is the projection of the phase space sheet onto physical 
space. Caustics occur wherever the phase space sheet folds back in phase space. At the fold 
the phase space sheet is tangent to velocity space, particles pile up, and hence the number 
density, which is the integral of the phase space density over velocity space, diverges. 
Generically caustics are boundary surfaces separating two regions in physical space. One 
region has n flows and the other has n + 2 flows, where n is an odd number. Because the 
flow of CDM particles is continuous, caustic surfaces are topologically stable. There are 
two types of CDM caustics in the halos of galaxies: outer and inner. An outer caustic is 
a simple fold (A 2 ) catastrophe, located on a topological sphere, surrounding the galaxy. 
Outer caustics occur where a given outflow reaches its furthest distance from the galactic 
center before falling back in. We provided estimates of the radii and fold coefficients of 
the outer caustics, based upon the self similar infall model. An inner caustic is a ring (i.e., 
a closed circular tube whose cross-section is a LL 4 catastrophe), with three cusps. Inner 
caustics are located near where the particles with the most angular momentum in a given 
inflow reach their closest approach to the galactic center before going back out. Inside the 
tube there are n + 4 flows. Outside the tube there are n + 2, where n is again odd. We 
gave a detailed analysis of caustic rings in the case where the flow of particles is axially and 
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reflection symmetric and where the transverse dimensions of the ring is tight. In that case, 
the flow, and hence the caustic, is described by 5 parameters: a, b, s, r and u. The precise 
shape of a transverse section of the ring in this limit is shown in Fig. 3. For the inner 
caustic rings, the curvature radii and fold coefficients depend on the position on the surface 
of the ring caustic. We derived formulae for these quantities as a function of this position 
(parametrized by Ti) and of the five caustic ring parameters (a, b, u, r , s). A great 
fraction of the surface of a caustic ring (0 < |^| < |) is saddle-shaped and therefore has 
tangent directions along which the curvature of the surface vanishes. Finally, we derived 
the mass density profile near a cusp caustic. These quantities were estimated using the 
self similar infall model. 

Gravitational lensing was discussed in Section 4. In general, the shift in image position 
is the gradient of a potential whose 2D Laplacian is the column density. For an arbitrary 
mass distribution, the procedure for calculating the shift involves two steps. First the 
matter density is integrated along the line of sight to obtain the column density. Second, 
the potential is obtained by convolving the column density with the 2D Green's function. 
In the case of lensing by dark matter caustics, this procedure can be simplified: the shift is 
expressed directly as an integral over the parameter space of the dark matter flow forming 
the caustic. The relevant result is given in Eqs. 347 and 349 if the caustic has contrast in 
the two dimensions transverse to the line of sight, and in Eq. 354 if the caustic has contrast 
in only one of the dimensions transverse to the line of sight. We applied our formalism to 
the gravitational lensing by dark matter caustics in four specific cases. In the first three, 
the line of sight is tangent to a surface where a simple fold catastrophe is located. The 
three cases are distinguished by the curvature of the caustic surface at the tangent point 
in the direction of the line of sight: (1) the surface curves toward the side with two extra 
flows; (2) the surface curves away from the side with two extra flows; and (3) the surface 
has zero curvature. In the fourth case (4) studied, the line of sight is at a specific location 
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close to a cusp catastrophe and parallel to the plane of the cusp. We found that each case 
has characteristic lensing signatures. In three of the cases (2, 3 and 4) there are multiple 
images and infinite magnification of point sources when their images merge. In case 1, 
there are no multiple images. Unfortunately, the effects are small even for dark matter 
caustic lenses at cosmological distances. The multiple images of a point source cannot 
be resolved with present instruments. Typical magnifications and image distortions are of 
order one % to a few %. A promising approach may be to observe the distortions caused by 
dark matter caustics in the images of extended sources, such as radio jets. This possiblity 
was discussed in case (1). 
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A Vacua of Non-Abelian Gauge Theory and Instan- 
tons 

A.l Description of Non-Abelian Gauge Theory 

Let the Lagrangian density be £(\I/(x), d^(x)), where ty(x) denotes an arbitrary field as a 
function of spacetime coordinate 3 ). We adopt the convention where the 

spacetime metric tensor has the components = diag(+l, — 1, — 1, — 1). Under a local 
(i.e., x-dependent) gauge transformation 

tf(ar) ^y'(x) = U(x)^(x) , 
d^(x) -> (d^(x))' = U{x)d^{x) + (d^U(x))^(x) , (393) 

where U (x) is an element of a Lie Group G. A Lie group is a continuous group generated by 
the corresponding Lie algebra which is a vector space whose basis consists of N generators 
T a (a = 1, . . .,N) that are closed under commutation: [T a ,T b ] = if abc T c . The structure 
constants f abc are real numbers and they completely define the Lie algebra. We adopt 
the normalization conditions Tv(T a T b ) = \5 ab . An element U(x) of G is obtained by 
exponentiating the algebra: 

U{x) = e-^WT* . (394) 

Because U is parametrized by {uj a }, the group manifold of G is the space of the possible 
values of {w a }- The correspondence between Lie group and Lie algebra is many-to-one. In 
order to have a one-to-one correspondence between {oJ a } and a group element, the possible 
values of {uo a } must be restricted. 

Gauge invariance, at the Lagrangian level, means 

£(*(*)', (d^(x))') = C(*(x), d^(x)) . (395) 

This requires invariance under global (i.e., ^-independent) transformations. However, since 
d l JJ(x) 7^ 0, in Eq. 393, d^(x) does not transform in the same way as ^>(x). This spoils 
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the gauge invariance of C, even if £ is invariant under global transformations. To preserve 
the invariance we look for a covariant derivative with the transformation property 

(D^(x))' = D'^'(x) = U(x)(D^(x)). Equivalently, since D'^'(x) = D'^U{x)^, we 
want D' fl U(x)^/ = U(x)(D fl ^/(x)) which implies 



D'U(x) = U(x)D, , 



(396) 



or 

D'^U(x)D,U(x)- 1 . (397) 

To find D^, we introduce a Lie Algebra valued vector field A^x) = A®(x)T a , and try the 
ansatz 

D^{x) = [dp - igA^(x)}^(x) . (398) 
If we demand A^x) to transform as 



A'^x) = U{x)A IJL {x)U- l {x) - -(dpU(x))TJ-\x) , 



(399) 



the ansatz Eq. 398 gives 



(D^(x)Y = D'^'{x) = {d,-igA'^x))U{x)y{x) 
= (d fl U(x))^ + U(x)d^(x) 
-ig 



U(x)A lx (x)U- 1 (x) - -(d^U(x))U-\x) 



U{x)^{x) , 

which yields the required transformation property Eq. 396 for D^(x): 



(400) 



(D^(x))' = U(x)[d, - igA^Mx) = U(x)D^(x) . 



(401) 



Suppose the gauge field is the vacuum configuration; A^ = 0. If we perform a spacetime 
dependent gauge transformation Eq. 399, we obtain 



Ap, - A,{x) = --(d^Uix^U-^x) . 



(402) 
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The form of A^{x) in Eq. 402 is called "pure gauge. 
For an infinitesimal change 



U kl = 5 kl -iT^u a + 0{uj 2 ), 



(403) 



Eq. 399 becomes 




T^A; + iu; b [T a ,T b ] M A; 




a 



(404) 



Using the commutation relation [T a ,T b ] k i = if abc T^ h we find 

SAVx) = f abc co b (x)A c (x) - -d^ a {x) . 



(405) 



To make the gauge fields A ^ dynamical variables of the theory we need to add their 
free Lagrangian density, which should satisfy gauge and Lorentz invariance and should be 
quadratic in spacetime derivatives of A®. Thus, we need the generalization of F^F^ of 
the Abelian case (Maxwell's theory) where the field strength tensor is F^ = d^A p — d v A^. 
We may try to define a field strength tensor of the same form d^A^ — d u A^, however, 
this transforms in a complicated way under Eq. 405. Recall that F^ v is the curvature 
(commutator of the covariant derivative) associated with the covariant derivative = 
— ieA^, hence F^ u = - [D^,D U ], where e is the electric charge. Using this analogy, we 



find 




igG^(x)V(x) 



zgT a G%{x)^{x) , 



(406) 



where 



G^(x) = d,Al(x) - d„A;(x) + gr bc A%x)Al(x) . 



(407) 
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Let us check that the above form of the field strength tensor transforms in a simple way, 
determined only by f abc . Because of the transformation property Eq. 397 of the covariant 
derivative , we have 

([D^,D u ]^(x)J = U(u(x)){[D„D^(x)) . (408) 
Using Eq. 406 in Eq. 408, we find that T a G'^U{u)m = U(u)T a G a ^, or 

T a G'« u (x) = U{uj{x))(T a G<; v {x))u-\uj{x)) . (409) 
Then, the infinitesimal transformation is calculated by inserting Eq. 403 into Eq. 409: 

5G%(x) = r bc u\x)Gl u (x) = i[G, v {x)Mx)\ , (410) 

as required. 

Up to here, we have been acting the covariant derivative on fields \& which trans- 
form as = —i^ a T^i. Now, we want to apply the covariant derivative to Lie Algebra 
valued objects V = V a T a . Before proving the form how the covariant derivative acts on 
the Lie Algebra valued objects, it is useful to introduce a generalized cross product for 
gauge group vectors V = V a T a , W = W a T a and X = X a T a , where the dummy index 
a — 1, 2, . . ., n: 

V{x) = (V\x), . . ., V n {x)) , W{x) = (W\x), . . ., W n {x)) , X{x) = (X\x), . . .,X n (x)) . 

(411) 

We define 

(V(x) x W(x)) a = f abc V b (x)W c (x) , (412) 

which implies 

(V(x) x W{x)) a = f abc V b {x)W c {x) = -i{[T b ,T c ]) a V b {x)W c {x) 

= -i([V(x),W(x)]) a . (413) 
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Consequently, we can make use of the usual cross product relations like 

VxW = -WxV, (414) 

V x (W x X) + W x (X x V) +X x (V x W) = 0, (415) 

where the later is known as the Jacobi identity. In this new notation Eqs. 405 and 410 
can be written, respectively, as 

SA^x) = u(x) x A^x) - ^d p £u(x) , (416) 

SG^(x) = uj(x) x G^(x) (417) 
Now, we want to show that the covariant derivative of V = V a T a must be of the form 

D II V = d fl V-tg[A fl ,V], (418) 

to have the transformation property (D p V)' = U(D fl V)U^ 1 . In other words, expressing 
the covariant derivative as D^V = T a {D^V) a and D^V = {{D^Vf : a = 1, . . ., n}, we 
will show that 

D;V = d fl V + gA II xV, (419) 
has the required transformation property, in infinitesimal form, 

5(D;V) = Cux (D^V) = Cdx (d„V + gAxV), (420) 

where 

SV = cJxV . (421) 

In particular, when V is taken as the field strength tensor, we will have D p G up = d^G up — 
ig[An, G vp ], where 5G^ U = cu x G^ as in Eq. 417. Now, to prove the claim Eq. 419 let us 
calculate the transformation 

S(D^V) = 5(d p V + gA p x V) = d^V + gSA^ xV + gA p x5V. (422) 
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Using Eqs. 416 and 421 in Eq. 422 we obtain 

5(d;v) = 9,(5 x - (3,5) x 7 + ff (dl x 4) x 7 + jl, x (w x 7) 

= uj x d^V + g ((u x Afj) x7 + I^x (wx vf) . (423) 

Finally, the application of identities 415 in Eq. 423 yield 

5(D^V) = iuxd^V + gCux(A^xV)=Cux (d^V + gA^ x V) , (424) 

which is the transformation we wanted in Eq. 420. Thus, we define = <9 M — i<?L4,, ] or 
Dp = 8^ + g(A^x ). Another relation we want to derive is 

[D^D V ]V = d^V + gA, x V) + gA^{d v V + gA^ x V) - ~ v) 

= g(d^A v - d v Ay) x V + g\A^ x (A u x V) - A v x (1, x V)) 
= g(d^A u - d v A^ + gAp xA v )xV 

= gG^xV, (425) 

where we have used the identities 415. Using result 413 in Eq. 425 we obtain 

[D^,D U ]V = -ig[G, u ,V]. (426) 

All the pure gauge types of vector potential, defined in 402, are vacuum solutions. 
This can be shown as follows. For the pure gauge fields 

G^(x) = {d^A v {x)-igA^x)A v {x))-(^^u) = ~[[d li (U(x)d v U- 1 {x)) 

+U(x) {dJJ-^xj) U(x) (d v U-\x)) (427) 

On the other hand, by taking the derivative of the identity U(x)U~ 1 (x) = /, we obtain 
(d^U)U- 1 + Ud^U- 1 = which gives 

a^Uix) = -U(x) (dpU-^x)) U(x) . (428) 
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Replacing d v U (x) in Eq. 427 using Eq. 428 one finds 

G, v {x) = . (429) 

Recall Eq. 410 that, unlike the Abelian case where f abc = 0, G^ v transforms 
nontrivially (under gauge transformation Eq. 399 the field tensor is gauge covariant in 
the non- Abelian case whereas gauge invariant in the Abelian case). However the trace 
Tr (G^G^) = \G1 u G a ^ v is gauge invariant: 

8[G%{x)G*"{x)\ = 25Gl v {x)G a ^{x) = 2f abc u; b (x)G c fll/ (x)G a ^(x) = . (430) 

This is because the indices a and c are antisymmetric in f abc whereas symmetric in G c ^ u G a ^ u . 
Therefore, 

£ gaugc = - l -TvG, u {x)G^{x) = -\g« u (x)G^(x) , (431) 

can be used as the kinetic term for the gauge vector field A a ^(x). This is, in fact, the 
Yang Mills Lagrangian density and in our the discussion we consider only the Yang Mills 
field theory. The equations of motion are obtained from the condition that the action 
S = f <£c 4 £ gauge be stationary against arbitrary variations of the gauge fields. Suppressing 
the explicit spacetime dependence of the fields, 

5S = - J d 4 xTr(G^5G< lu ) , (432) 

where SO™ = (d^5A u - igSA^A" - igA^5A u ) - (fj, <-> v). Thus, using G^ = -G Vfi , we 
obtain 

5S = -2 J d 4 xTr [G lxv {d>"8A v - ig5A"A v - igA^SA")] . (433) 
Integration of the first term by parts yields 

5S = 2 J dx 4 Tr [VG^ + igG^ (5A»A U + A"6A V )] . (434) 
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Since Tr [G^SA^A U ] = -Tr [A»G pu 5A v ] we have 



8S = 2 J dx^TrWG^-igiA^G^SA" 



Thus the field equations are 



(435) 



D*G IH , = d> l G IH ,-ig[Ai l ,G IH ,\ = Q, 



(436) 



or equivalently, 



Using the Jacobi identity, 



D^G% = d»G a uv + gf abc A b »G c uv = 



jJLU 



(437) 



[Dp[D v , D p \] + [D V [D P , Dn\\ + [D p [D^ D u ]] = , 
and Eq. 406, [D^, D u \ = —igG pu we obtain the Bianchi identity: 



(438) 



[D p , G vp \ + [D v , G pp ] + [D p , G^\ = . 



(439) 



Let's calculate the above commutators: 



<9 P - ig[A p ,], G, P J = G^]* - ig \[A^ ], G„„ 
dfj,{G vp ^) - G up d^ - ig([A p G vp ^} - G up [A^, *]) 
(fyG„„ - iglA^, G vp \) * = (D^Gvp)^ . 



(440) 



Likewise [D^, G pp: ] = D v G PPi and [£> p , G piy ] = D p G pu . Hence the Bianchi identity given in 
Eq. 439 reduces to 

D p G vp + D u G Pfl + D p G, v = . (441) 
By introducing the dual field strength tensor 



= -e^ pa G pa 



(442) 
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we can also write the Bianchi identity as 

D*G^ V = . (443) 

Here we immediately observe that the (anti)self dual field configurations, where G^ u = 
=F*G M i, respectively, automatically satisfy the field equations D^G^ = due to the Bianchi 
identity This result is appealing because it is easier to solve the first order self duality 
equations *G iiV = G^ v rather than second order field equations D^G^ = 0. 

The symmetric stress energy tensor is obtained easily by coupling the theory to a 
background metric g^: 



L = -\G^G a pa g^V^ , (444) 



and varying the action with respect to g^ u : 

V-g \09nu 09nu, P og^paj 
Using the identities 5g = gg^Sg^ and dg pa /dg^ = -(g^g ua + gP u g^)/2 we find 

We define the electric field E = E a T a and the magnetic field B = B a T a , which are elements 
of the Lie Algebra: 

j^ai G a ^ 

B ai — _\ t i 3 k Q a i k ^ Qaij _ _ e ijkj^ak (447) 



In terms of = (A , A) 



E i = ^A -d°A l + ig[A°,A l ], (448) 
B i = - e ^\d 3 A k -igA j A k ) . (449) 
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These matrices are not gauge invariant, they transform according to Eq. 410. The equa- 
tions for non-Abelian electric and magnetic fields can also be written as: 

f)A a -H 

E a = -VA a0 - — + gf abc A b A c \ (450) 

B a = V x A a - 9 -f ahc $ x A c . (451) 
2 

The above equation implies that non-Abelian gauge theories allow magnetic monopoles 
with magnetic charge density: 

p = V • B a = -?-f abc V ■ (I 6 x A c ) . (452) 
2 

Total magnetic charge is the surface integral at spatial infinity and there are monopole 
solutions for which the integral does not vanish. 

The components of the dual field tensor 442 are 

* QaOi ^J-jk Qajk B a ^ 

2 ~ 

*^iaij ^ijkQdOk ^ijkjjjak (453) 

Duality replaces E m — > B m and B m — > —E a% . The Lagrangian density and energy can be 
computed in terms of E m and B a% : 

£ gaU gc = ~{-2G a0i G a0i + G aij G aij ) = -{E a -E a -B a ■ B a ) , (454) 
4 2 

T oo = G a0i G a0i -g 00 C=^{E a -E a + B a -B a )=S , (455) 
where 8 denotes the energy density. Using Eq. 453 we find 

*G a ^G* u = -4E a ■ B a . (456) 
The equations of motion 436, in three vector form, reads 

VxB a = — - + gf abc (A b E c + A b xB c ) , (457) 

C6 

V-£ a = gj abc $-E c . (458) 

They are the generalizations of Maxwell Equations. 
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A. 2 Classical Instanton Solutions of Non-Abelian Gauge Theo- 
ries 

A pure gauge theory can have "static" soliton solutions only in four dimensional Euclidean 
space [60] (here static means that the solutions are independent of one of the four co- 
ordinates). In this section, we will study such a solution, called the instanton, which is 
characterized by a topological charge and finite actional, 62, 63, 64, 65]. Instantons are 
important for the quantum theory, because they describe tunneling between degenerate 
vacua. Instanton solutions play a crucial role in understanding the quantum theory of 
gauge theories. We will use most of the results obtained in this section when we discuss 
the nontrivial vacuum structure of non Abelian gauge theories, later in this Appendix. 

We will concentrate on the Euclidean SU (2) gauge theory which is obtained by making 
the following replacements in the Minkowski space version of the theory: 

x° -> -ix A , 9fiu = (+, -, -, -) -> = (+, +, +, +) , (459) 

where the indices run from 1 to 4 in Euclidean space and e 1234 = 1 (e^ pCT is a tensor density; 
it has to be multiplied by the determinant of the transformation when it is transformed. 
That is why it does not pick up an i under x° — > —ix A ). Hence do — > id^, D — > iD^, 
and A (x) — > iA±(x E ). Note that A and d are components of vectors hence transform 
similarly, whereas x° is a component of a co-vector and transforms oppositely. Then the 
fields transform as 

Ef^iEf, Bf^Bf. (460) 

Defining x 4 = T (hence t = x° — > — ix A = —iT), where T is a real parameter, the 
transformation of the action is 

iS M = ijd 3 xdtC M (t, x) -*ijd 3 x(-idT)£ M (-iT, x) = -Jd 4 x E C E (x E ) = -S E , (461) 

where we define C E (x E ) = —£M{—iT,x). Continuation to Euclidean spacetime means 
that we consider the dynamical evolution of the system in imaginary time. In principle, we 
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must solve the equations of motion in which the time x° is replaced by — ix A . The Lorentz 
invariance 5*0(3, 1) of the Lagrangian density is replaced by invariance with respect to 
SO (A) rotations in Euclidean space. The equations of motion determines how the fields 
are to be continued into Euclidean space. For example, for a real scalar field 4>(x), the 
action in Minkowski spacetime is 



S 



M 



dtd x 



- {(d <f>(x M )) 2 - V 2 0(x M )) - U(<j)(xM)) 



(462) 



where U is the potential energy. Let's consider the equations of motion 



d 2 <j)(xM) - d 2 <j)(x M ) + U'((/>(x M )) = , 



(463) 



where prime denotes the derivative with respect to the field. If we Euclideanize the space- 
time so that do — > id& and <9j — > di, we obtain 



-dl<p{x E ) - d 2 <p{x E ) + u'Mxe)) = o . 

To have the required SO (4) invariance we must have 

4>(xm) = 4>{x° = t, x ) -> 4>{x E ) = 4>{x 4 = T, x) , 



(464) 



(465) 



U{<t>{x M )) -> U(<j>(x E )) = -U(<j>{x E )) . (466) 

Therefore, a real scalar field <fi(x) defined in Minkowski space is replaced by a real scalar 
field <f>(x) invariant under 5*0(4) and the potential energy picks up a minus sign. In 
Euclidean space Eq. 463 is replaced by 



-d 2 E 4> E - U'{4>{x E )) = . 
The action in Eq. 462 therefore becomes 

S M - J (-idx 4 )d 3 x [i ( (id xi <P(xE) f - V 2 0(x s )) + U(<j>(x E )) 



(467) 



% \ d A x E 



^{d E <t>{x E )) 2 -U{<t>{x E )) 



= iS E , 



(468) 
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which confirms Eq. 461. The relation between the path integral formulation of the two 
regimes is therefore 

J [dA]e tSM -> J [dA}e- SE . (469) 

In the rest of the section, we will drop the index E and adopt the summation convention 
when the repeated indices are both lower or upper. Using Eqs. 454 and 460 in Eq. 469, 
we find that the Lagrangian in Euclidean space reads 

C = \g%G% = \{EtEt + BfB?) . (470) 

Non Abelian electric and magnetic fields are assigned to the strength tensor exactly the 
same way as in Minkowski spacetime: = —Ef, and Gfj = —e^Bf.. These definitions 
are consistently yield the above action. The dual field strength tensor *G a jXV = ^^paG 0, 
has the following components: 

•G£ = S?, *G? j = e ijk E a k . (471) 

Duality replaces Ef <-> —Bf, therefore self dual and anti-self dual field configurations 
satisfy 

G% = ±*G«„ & El = T B? . (472) 

Due to the Bianchi identity (Eq. 443), selfdual and anti-selfdual solutions automatically 
satisfy the equations of motion D^G a = 0. Hence, it is sufficient to find gauge potentials 
yielding Ef = TBf to find a solution for the equations of motion. 

In Euclidean space the stress energy tensor T pu = G® a G® a — \g p,uG a pa G a pa has the 
following components: 

T 44 = X -{EfEf-BfBf), 

Tij = EfEj — BfB" — ^5ij(E%E% — B^B^) , 

T 4i = e ijk E«B a k . (473) 
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All of the above components vanish for the self dual or anti self dual fields (as Ef — > TB" 1 ). 
Thus, in Euclidean space, the energy momentum tensor for (anti)self dual solutions is 
identical to zero. 

The other 5*0(4) (S0(3, 1) in Minkowski spacetime) and gauge invariant quantity is 
given by the pseudoscalar density: 

T = ^Tr GV*GV = -E?B? . (474) 

This term is not considered as a kinetic term because it is a pure divergence, it can be 
written as 

T ee d p K, ee ±- 2 d, J p , (475) 

with 

K p ee e^ pa Tr[A u d p A a - i^A v A p A a ] . (476) 

(Here, let us note that for the group SU(2), one can easily evaluate the traces by introducing 
Pauli spin matrices {a a ;a = 1,2, 3} as the generators T a = |a a , hence A p = A a ^a a . The 
identity a a a b = 5 ab + ie abc a c yields Tr(a a a b ) = 25 ab , Tr(a a a b a c ) = 2ie abc and one obtains 
K — -e \ A a F) A a -I- 9p ahc A a A b A c ~\ \ 

Let us prove Eq. 475: 

T = ^ upa Tr { [(d p A u - igA p A v ) - (p v)\ [{d p A a - igA p A a ) - (p <-> a)] } 
= e^ pa Tr [(d p A v )(d p A a ) - ig{d p A v )A p A c - igA p A v d p A a - g 2 A p A v A p A a ] . 

Using the cyclic property of trace, the last term in the above equation can be eliminated. 
Renaming p <-> p and v <-> a in the third term, we obtain 

T = e^Tr [(d^A u )(d p A a ) - 2ig(d p A u )A p A a ] . (477) 

It is trivial to show, again using the cyclic property and renaming the necessary indices, 
that 

W Tr [(d p A u )A p A a ] = ^ M Tr \A v A p A c \ . (478) 
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Moreover, 

W- Tr [(^A u )(d p A a )} = e^dpTr [A v d p A a ] , (479) 

because e pvpcT Tr [A v dpdpA a ] = due to the symmetry of the partial derivatives and the 
anti symmetry of the Levi Civita tensor density. Thus we have 

= dpKp , (480) 

as claimed. 

Integrating T over all four space we obtain the topological charge (Pontryagin index) 
q[A) = (g 2 /87i 2 ) j d 4 xT of the Euclidean field configuration A a (x)p\ 

= ^2 / rfS A + *in g = i dY,pJp + g sing , (482) 

where the surface integral is on the boundary of four space, S* 3 , with the measure oE, 
and g sing is included because of the possible solutions that might contribute to the integral 
independent of their behavior at the boundary (e.g., Dirac-delta like solutions or solutions 
which yield divergent volume integral Eq. 481 because of their singularities away from the 
boundary). For nonsingular solutions g sing = 0. Let us note that q[A] is gauge invariant: 

4 = yf^ / d'xTriG^G^} = yf^ / d^UG^U^U ^U' 1 ] = Q • (483) 

The importance of the nonsingular solutions becomes clear in quantization of gauge fields 
via path integral methods where the integration is over all classical field configurations 
A a {x) in imaginary time that interpolate between the vacua at xq = ±oo. The dominant 
contributions to this integration comes from the field configurations for which S^LA™] is 
stationary. Here lies the particular relevance of the (anti) self dual field configurations which 
satisfy G pL , = T*G a pu . They are solutions of the classical field equations DpGp V = due to 
the Bianchi identity Dp'Gp V = which is always satisfied. 
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.2*7 



T — dp ep U p a Ti( y A u dpA . i—A u A p A a ) 



Before trying to find solutions of the field equations, let us show that the action for 
arbitrary Euclidean solutions is bounded below by |(87r 2 /<yf 2 )g[y4^]|. In Euclidean space we 
have 

n[(G> T *G, U ){G, U =f *G, V )\ > , (484) 
since it is the sum of squares. The above equation implies 

Tr[G^G^ + *G flu *G flu ] > ±2Tr[G^*G^} . (485) 

In Euclidean space e^^e^e = 2[S pi p5 a Q -5^5^], thus Tr[*G^ u *G flu ] = Tr[G^ u G^]. Then, 
Eq. 485 gives 

Tr[G^G^] > ±Tr[G^ *G^\ . (486) 

This means for the action 

S E = \j d^xTrlG^G^] >± l -J d A xTi[G, u *G, u ] . (487) 

However, since TrfG^G^] > 0, the action S E > 0. Therefore we obtain the result we 
want 

S E > 



l - J d*xTr[G, u *G, 



8vr 2 

' " \q[A]\ • (488) 
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The lower bound is reached (i.e., the equality is satisfied), when G^ u = ±*G^ U . Thus 
the action has its minimum value S E = (87r 2 /g 2 )|g[A]| for (anti)self dual solutions. We 
had already encountered another property of these solutions. They also had zero energy 
momentum tensor (Eq. 473). We are now ready for the topological interpretation of q[A]. 
Recalling Eqs. 482 and 485, for nonsingular field configurations, the Euclidean action: 

S E = \ [ d'xTxlG^G^} > %\q[A]\ = ^- 2 l d^J, . (489) 
1 J 9 z 9 Js^ 

Thus the minimum value of the action depends on the behavior of the gauge fields at 
infinity. For S E to be finite, G^ v has to decrease sufficiently fast to zero at Euclidean 
infinity: 

lim G^(x) = . (490) 
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Therefore, the potential must go to a pure gauge configuration at infinity: 

A„ -> -^(d^U)U- 1 for x 2 -> oo , (491) 

which is obtained from 7L M = by a "gauge transformation." Here the term gauge transfor- 
mation must be used with precaution. There are so called "large gauge transformations" 
that yield physically distinct vacua which are gauge inequivalent to the vacuum = 0. 
We will discuss this issue, in detail, later in this Appendix. The minimum value of the 
action is reached by asymptotically pure gauge configurations that yield asymptotically 
vanishing fields (Eq. 429). The boundary condition Eq. 491 defines a map at infinity. At 
the infinity of four dimensional Euclidean space E 4 we have boundary S^. Equation 491 
associates an SU(2) gauge group element U (x) with each point on the boundary 3-sphere 
S 3 ^ of E A . Hence any solution with the behavior of Eq. 491 defines a map: 

x -> U(x) : Si -> SU{2) . (492) 

In fact, since the 577(2) group manifold is topologically the same as S 3 , the solution 
with condition Eq. 491 determines a map 

Sl^S 3 . (493) 

All maps of the S* 3 onto itself are decomposed into homotopy classes by an index n G Z, 
called winding number. The integer n is simply the number of times S 3 gets covered by 
the map from S^. The trivial vacuum gauge field A^ = maps all the points of to a 
single point of the gauge group space S* 3 (which can be identified as the (minus) identity of 
the group). The trivial map has zero winding number since it does not cover the S* 3 . 

A. 2.1 The Winding Number 

Now, we will show that for gauge fields A^(x) with the asymptotic behavior of Eq. 491, 
the winding number n is given by [61, 62, 63] 

q[A] = 32^ / d ' xGa ^ G ^ = n ■ ( 494 ) 
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Let us start by calculating the current J M for a pure gauge Ap = —(i/g)(d tl U)U x : 



; 2g 

3 



AvdpAu — i—A u ApA a 



—If Tr 



(d v u)u- l d p l(d a u)u- 1 ] - ^(dMp-^dpUp-^d^u- 1 



Evaluating the term 

dp l(d a U)U- 1 } = (d p d a U)U- l + (d a U)dpU- 1 = (d p d a U)U- 1 -(d a U)U-\dpU)U- 1 , (495) 

where we used identity Eq. 428 to write dpU^ 1 = — ?7^ 1 ((9p?7)f/~ 1 , and inserting back in 
Eq. 495 we find 



T — —If Tr 



id v u)u-\d a u)u-\d p u)u- x - fauW-^d^u-^dMW- 1 



(496) 



Moreover, if we rename a <-> p in the first term above and sum up the terms, then 



Jp = ~ep U p a Tr[(d u U)U-\dpU)U- 1 (d C7 U)U- 1 } . (497) 
Thus, for nonsingular solutions satisfying Eq. 491 

q[A] = lk* = £^W Tr l(d u U)U-\dpU)U-\d a U)U- 1 ] , (498) 

depends only on the group element U(x). It is interesting that the minimum value of an 
arbitrary Euclidean action depends only on the properties of U(x) at infinity, and not on 
the details of the field configurations at finite x. The boundary, is parametrized by 
three angles ipi(x), fafa), fafe), while the group elements U are characterized by three 
angles O^x), 6 2 (x), 6 3 (x): U(x) = U(6 a (x)). Then, 

^ 5 rsr<w. <*»> 

Hence, 

q[A] = ^ / dS M e^(9A)(9A)(^0c)1t [(daUp' 1 (d^U' 1 (dMp' 1 ] . (500) 
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Since e llvpa is cyclic in (upa) and Tr [(d a U)U 1 (d b U)U 1 (d c U)U x ] is cyclic in (abc), we 
obtain 

q[A] = ^ £ 3 dE^ CT (dA)(d,0 2 )(d CT 3 )Tr [{d^U' 1 {d 2 U)U~ 1 {d^U' 1 ] . (501) 
The measure transform between the coordinate systems as 

d9 x d9 2 d9 3 = (Jacobian)#i#2# 3 d9 x d9 2 d9 3 = dL ll e l]H , piT (d v x )(d p 2 )(d a 3 ) . (502) 
Therefore, we can write q[A] in the following form: 

q [A] = ^\n I d9 1 d9 2 d9 3 TY[(d 1 U)U- 1 (d 2 U)U- 1 (d 3 U)U- 1 ] . (503) 

47T / q3 

Now, here the integration is over the group manifold of the gauge group Sg V ^ instead 
of the boundary manifold S^. The integer n G Z is there because it denotes how many 
points of is mapped to one point of Sg U( ^ 2 y Choosing for 9 a , the Euler angles, we can 
always parametrize an SU (2) group element 

U(9 a ) = e ¥^s e ¥^ e ie 3 a 3 ^ u-i(Q a } = e -^ e -¥^ e -¥^ , ( 50 4) 
where < 9± < 2n, < 9 2 < ir, < 9$ < Air. Then, after trivial cancellations, we find 



U = Tr [(9iC/)C/- 1 (9 2 C/)C/" 1 (9 3 C/)C/- 1 ] 

(505) 



= -*Tr 
8 



a 3 e^ dir73 a 2 e^ 2a2 (T 3 e-i e2(T2 e-^ eia[i 



In Eq. 505, we can rewrite the term / = e^ d2a2 a 3 e 2 6 ' 20 " 2 ; by power expanding the expo- 
nentials, as 

T = a 3 + C-9 2 )[a 2 ,a 3 ] + [<x 2 , [<r 2 , <x 3 ]] + j^ 2 f [a 2 , [a 2 , [a 2 ,a 3 }}} + ... 

= a\l-^9l + ...)-a 1 {9 2 -^9l + ...) 

= a 3 cos (9 2 ) — o\ sin (9 2 ) . (506) 
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Then Eq. 505 becomes 



U 



--Tr 



— Tr 

8 



a 3 ei 6lfT3 a 2 (a 3 cos (9 2 ) - a x sin (9 2 )) e"^ 3 
e^ lCT3 (-i(7i) ((7 3 cos (0 2 ) - (7i sin (0 2 )) e'5 e i-3 
-i cos (9 2 )ei eia3 a 2 e~i eir73 + I sin (0 2 ) 



Next, we use the identity 



a 2 cos (0i) + o"! sin (9i) 



(507) 



which can be verified the same way as in Eq. 506. Thus 

U = — -Tr [— i cos (0 2 ) cos (0i)(7 2 — i cos (9 2 ) sin (0i)(7i + / sin (0 2 )] 
8 

= -1 sin (0 2 )Tr [I} = -\ sin (ft,). 
Finally, we can calculate q[A\ using Eqs. 503 and 508: 

i r2iv pAn pit i 

q[A} = -—n / d9 l / eZ0 3 / d0 2 (--) sin (0 2 ) = n . 
47r io 4 

Equation 509 can be expressed in Minkowski spacetime. Since (d 4L x) E = id 4 x; G" 4 
and e 1230 = —1 (whereas e 1234 = 1), we have 

-.2 



q M [A] = 



9 



327T 2 



(508) 
(509) 

(510) 



The winding number n is a measure of how many times the group manifold SU(2) is 
covered by the map U(9(x)), when x spans the whole boundary of the four dimensional 
Euclidean space, 5^-,, once. 

We now consider the simplest nontrivial case, n — 1, the instanton. Instantons 
are localized, nonsingular self dual solutions of the classical Euclidean Yang-Mills field 
equations D^G^ U = with one unit of topological charge. They have vanishing Euclidean 
energy momentum tensor: T^ v = 0. In particular the Hamiltonian density H = T 00 = 0. 
For instantons the associated action is 



S E = 



8tt 2 



\q[A}\ = 



8tt 2 



9 9 

Having defined the instantons, we construct an explicit solution in the next section. 



(511) 
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A. 2. 2 An Explicit Instanton Solution by Construction 

We start with making some useful definitions [61, 64]: 



0> = (cr a , ±i), a* = (a a , =p), = 1(5^ - a^a v ) ,* "a ^ = -e^ p( pa p( p = ±0^ , (512) 

where \i and v runs from one to four, whereas a runs from one to three. Notice that is 
self dual if er 4 = +i, or anti self dual if 04 = —%. Let's continue with the following ansatz: 



U = 



x ■ a =f 1x4 



x 



jj— 1 n X • (7 i 2X4 



• ' 1 



1 



so that 



^purc _ \J)U~ X = - ^ M ^M^^lj _ 1 



(513) 



(514) 



X z (? x^ 

where we have used the identity (x^cr^) (x„c4) = x 2 . For a finite energy solution we need 
an A p which is not a pure gauge over the whole volume. We make the following ansatz for 
the potential 

1 r? r 

(515) 



1 On V X v £ , 2\ 

A,. = — H^/(x ) . 



m 2 
# x 2 



We require /(0) = for regularity at x = 0, and /(oo) = 1 for finite energy. To make this 
a solution, we only need to choose / such that the field strength tensor is self(anti-self) 
dual: G pv = ±*G pv . From Eq. 515 we obtain 



G 



2 (7(1-/) 



5-1 

2 17(1-/), 



0> + 



X^ 



pv 



9 



x 



2 a > xv 



f _ /(I " /) 



X s 



^ ixv p<f)\P ' , 'y*£ '(f> (J ^X^X pj /> , 



(516) 



(517) 



where prime denotes derivative with respect to the argument. Thus, G pu = ±*G pu , if the 
second term vanishes 

„ /(I " /) 



/' 



0. 



x z 



The general solution satisfying the required boundary conditions is 

„2 



cx 



CX' 



+ A 2 



(518) 



(519) 
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Note that as x 2 — > oo, /(a; 2 ) — > 1, hence — > ^4P urc as required. The solution with 
topological charge q = =Fl, has c = 1, as we now show. Setting c = 1, we find 



^ = ~gx 2 + \ 2 
We can write the vector potential more explicitly: 



(520) 



1 (x x a) ± £417 
5 



x 2 + A 2 



2 {tij a Xj dz <5 a jX4) 



.X" 



+ A 2 



(7 x 2 + A 2 
The field strength tensor becomes 



1 a ■ x .„ 2 x° 



G 



±*G 



2/(W) 

± ^ (T 



9 



from which we obtain 



Ef = a- 



±- 



A 2 



± 2 - 



(521) 
(522) 



A 2 



^ (x 2 + A 2 ) 



(7, 



;5? 



! ^(x 2 + A 2 ) 2 " 1 



(523) 



(524) 



Let us confirm that the topological charge q[A] = =Fl. For self (anti-self) dual fields: 
Ef = ±B?. Then, since (l/4)G%*G« u = -E?B?, we have: (1/4)G%*G% = TE?E?. 
Hence, 



8tt 2 



1 R\4 



d X- 



1 



x 2 + A 2 ) 4 ' 



(525) 



In four dimensional Euclidean space d A x = dfl 3 r 3 dr = n 2 r 2 d(r 2 ), thus 



q[A] = T6A 



°°r 2 d(r 2 



o (r 2 + A 2 ) 4 



T-6 



zdz 



1 



3(1 + z) 



2(1 + z) 



= Tl, 



/o(l + ^) 4 

where we changed the integration variable r 2 = \ 2 z. Later we will show how instantons 
are related to tunneling events between physically distinct gauge field vacua. But, before 
that, we want to study the quantum theory of non Abelian gauge fields in the next section. 
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A. 3 Canonical Formalism and Quantization 

Let us turn back and formulate the classical gauge theory using the canonical procedure [61, 
66]. The Hamiltonian density and Hamiltonian are obtained using the standard recipe: 

H = U^d A^ - £ gauge , (526) 

H = I d 3 xH , (527) 



where I1 M is the canonical conjugate of the gauge field. From Lagrange density Eq. 454, 
we find 

dC 

nSW = 5(SMS)= - (528) 

mx) = m&j = ~° ai >i = ~ Em ■ (529) 



Thus 



n = -E a -— (E a ■ E a - B a ■ B a ) . (530) 

dt 2 V 1 v 1 



Using Eq. 450 we can re-express the first term of Eq. 530 

-E a ■ — = E a ■ (E a + VA a0 - g f abc #A c0 ) 
dt 

= E a ■ E a + V{A a0 E a ) - A a0 (V ■ E a - gf abc A b ■ E c ) . (531) 

Then, Eq. 530 becomes 

H = - (E a ■ E a + B a ■ B a ) + V(A a0 E a ) - A a0 D ■ E a , (532) 
2 

where 

D ■ E a = V • E a - gf abc A^ ■ E c . (533) 

If we try to impose the canonical equal time commutators (Poisson bracket relations in 
the classical theory), we immediately encounter trouble since n = because C gauge does 
not depend on d A a0 . Since A a0 is not a dynamical variable, we may eliminate it from 
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the Hamiltonian by solving for it in terms of the dynamical variables through the classical 
constraint equation Eq. 458. This constraint equation (D ■ E a = 0) is nothing but the 
variation of the action SS given in Eq. 435 with respect to 5 A . The component A a0 may 
be expressed in terms of the dynamical variables by inserting E a given in Eq. 450 into the 
constraint Eq. 458: 



dA 



V 2 A ao _ j-abc^b . ( 2 VA c0 + g—- + g 2 f cpq A p0 A q ) + gf abc A m V ■ A c = . 



dt 



(534) 



The usual procedure however, is to use the gauge invariance and set A a0 to zero. In this 
case Hamiltonian is the energy with density Eq. 455: 

1 



H = 1 I d 3 x((E a ) 2 + (B b ) 2 



When the canonical commutation relations are imposed 



d 6 xT 



oo 



(535) 



[A*(2,t),Al b (*,t)] = 0, 
[E*(2,t),E? b tf,t)] = 0, 
[E ia {x , t), A^ix 1 , t)] = i5 lj 5 ab 5{x - x') , 



(536) 



Hamilton equations = i[H,A a ] and = i[H, E a ] yield, in the temporal gauge, the 
definition of the electric field 450 and Ampere's law Eq. 457 (spatial components of Eq. 
436) respectively. Let us verify these claims. The Hamilton equation for A a (x) yields 

— Or) = i[H, A a (x)} = -J d 3 y{ [(E b (y)) 2 , A\x)\ + [(B b (y)) 2 , A\x)\ } , (537) 

however, since 

B b (y),A a (x) 



= VyX 



4V„, 9 fbcd 

#(y),i a (x) 



V, x A\y) - y bcd A\y) x A d (y), A a (x)\ = [v w x ^(y),!^) 



, 



(538) 



using the distribution law and Eq. 536 we recover the definition of the electric field: 

r)A a i f r -> i 

-fi-W = g / d3 V [(E b (y)) 2 ,A«(x)\ = -E%x) . 



(539) 



140 



The Hamilton equation for E a i(x) yields 

f)E a i i f r -> i 

" x)=i[H,E*i(x)] = - d 3 y (B\y)) 2 ,E^(x) = i d 3 yB M (y)[B M (y), E a > (x)] , 



dt 



X 



where 



i J d 3 yB M (y)[e ipq d yP A bq (y) - ^f bcd e™A cp (y)A dq (y),E a h 



ie ipq J d 3 yB bi (y)d yP [A bq (y), E a] {x)] = e ipj J d 3 yB ai {y)d yP 5{x - y) 
J d 3 y {d yP (B m (y)5(x-y)) - (d yP B a \y)) 5(x - y)} 



(540) 



e rpi |y d 2 ZB ai (x) - d xP B a \x) \ = c J '"0. r ,B"'{.r) 

v x 5 a (x)y , 



and 

X b = - l -gf bcd e^J d 3 yB b \y) [A cp (y) A dq (y) , E a > (x)} 

= ~9 {f bca e ip] B bl {x)A cp {x) + f bad e ljq B bl {x)A dq {x)} , 
adding the two terms up we find 

X & = -gf abc x S c (x))' . 

Hence 

J a + l b = (V x £ a (z) - gf^A^ix) x S c (x))' , 

which implies 



dE a 



x ) = V x 5 a (x) - gf^A^ x 5 c (:r) = D x B a . 



(541) 

(542) 
(543) 

(544) 

(545) 
(546) 



However, the Gauss' law (the time component of Eq. 436), D ■ E a = 0, is not found. Note 
that B a (x), given by Eq. 451, is not a fundamental variable and its definition does not 

— * 

appear in the Hamilton equations either. B a (x) is given by Eq. 451. 
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In Eq. 399, we have seen that, classically, the theory has a symmetry which leaves 
the equations of motion invariant. We choose the gauge A°(x) = 0. But this does not 
completely fix the invariance. The infinitesimal form (Eq. 405) of the residual symmetry 
transformation is 

5A a {x) = -g-'Dcu^x) = -^-Voo a {x) + / abc ^(x)u;(f) . (547) 

Here, unlike Eq. 405, u(x) is time independent. The time dependent gauge transformations 
would generate nonzero A a0 (x) in view of Eq. 399. Quantum mechanically this residual 
gauge invariance is expressed through the operator equation [61, 66]: 

[D ■ E a (x), H] = i^-D ■ E a = W ■ E = . (548) 

To see that [D ■ E a {x), H] = 0, let us first simplify Eq. 548: 

[D ■ E a (x), H] = i{d t E? + gf abc A^ - gf abc A^) . (549) 

Using A b = —E b , we obtain 

[D ■ E a (x),H] = i&E* - gf abc A b E^) . (550) 

On the other hand, from the equations of motion {J) a ^F bvii = 0), we have 

D ab F biO _ q ( 551 ) 
D ab F bOi + pabpbji = _^ab _ gfabd^flb + jjab pbji = q _ 

In the temporal gauge, Eq. 552 becomes 

E? = DfF b 3l . (553) 

Then the commutator [D ■ E a (x), H] = iD ■ E can be expressed as 

[D ■ E a {x), H] = i^d^ - gf abc A\E'r) = iD ab E b = iDfD bc F^ 

= \{DfDf - D?D bc )F< t = D^F^ , (554) 
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where 



[A, Dj] ac = DfDf - DfD bc 

= {5 ah d l - gf^Afii^dj - gf bcd A d ) - (5 ab d 3 - gf^A^di - gf bcd A d ) 
= -gf acb (d t A b - djA b ) + g 2 r be f bcd (A*A d - A)A d ) 

= -gf acb {d l A b - djA\) + g 2 (f abe f bcd - f abd f bce )A'rA d . (555) 
Using the Jacobi identity, f abe f bcd - pMfbce = facbfbde^ we recagt Eq> ( 555 ) ag 

[D t ,D 3 r = -gf ac \d^ b j -d j A b i )+g 2 f acb f bde A':A d j 

= -gf^dA) ~ djA\ + f™A\A^ = -gf acb F b . (556) 

Hence, Eqs. (554) and (556) give 

[D ■ E a (x), H] = l -[D i: D,] ac F^ = - % -gf^F b 3 F c l3 = . (557) 

The commutator vanishes because the indices a and b are antisymmetric in f abc whereas 
they are symmetric in F^F^. 

Therefore, an eigenfunction *&[A] of H can be chosen as a simultaneous eigenfunction 

— * — * 

of D ■ E a {x). Thus, Gauss' law is imposed on the Hilbert space by choosing the eigenvalue 
of D ■ E a (x) to be zero 

D-E a (x)^[A(x)}= 0. (558) 

We are particularly interested in vacuum configurations. Classically the ground state must 
correspond to time independent minimum energy solution. We have, therefore, = 0. 
To prove this, consider the action for a time independent solution (i.e., the space integral 
of the Lagrangian density) [67]: 

S = Si - S 2 = ^ I d 3 x (E a -E a -B a ■ B a ^j . (559) 
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Since the total energy S is the sum Si + S 2 , its finiteness implies the finiteness of Si, S 2 
and S. Any solution, if exists, is unstable (not a solution) under the scale transformations: 

A a0 (x) -> p\A a0 (\x) , A m (x) -> AA ai (Af) . (560) 

Hence 

£ ai (f) -> p\ 2 E ai (y) , £ oi (£) -> A 4 5 a *(y) , (561) 
where y = Af. Therefore, 

51 = (1/2) y SxE^E^S) - 5x = (l/2)Xp 2 J d 3 yE ai (y)E ai (y) , 

5 2 = i y rf 3 xfi ai (f)5 ai (f) -> S 2 = ^A^ d 3 yB ai (y)B m (y) , 

S = Si-S 2 ^ Xp 2 S 1 - XS 2 , (562) 

which should be stationary at p = A = 1. Thus, the variations of A and p at A = p = 1 
yield Ci — C 2 = and 2£i — C 2 = 0, respectively. Together they give C\ = C 2 = 0, 
hence G^ v = (as expected in the ground state E a = B a = 0). This means that (in 
the temporal gauge) the field A(x) is a pure gauge: A(x) = (—i/g)(VU(x))U~ 1 (x). Now, 
to make progress, we make a very important hypothesis regarding gauge transformations. 
We assume that the allowed gauge transformation matrices U(x) have the same definite 
space-time independent limit at spatial infinity: 

lim U (x) = Uoo = const. (563) 

|x|— +oo 

Physically, this assumption is necessary at least to have a well defined non-Abelian electric 
charge. The total electric charge Q a = g f drf abc A b ■ E c = f drV ■ E a = f dS ■ E a . Under 
a local gauge transformation charge transforms as 

Q a -> / dS-U~ l E a U . (564) 
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If U has a constant limit at spatial infinity, Q a transforms by the global gauge transfor- 
mation U^ 1 Q a U 00 . Otherwise, the transformed charge has no simple relation to the original 
charge and we must conclude that the non-Abelian electric charge is not well defined. Let 
us also note that, because of condition that all pure gauge potentials have the same limit at 
infinity, the possibility of transitions between different the vacuum configurations can not 
be ruled out by kinetic considerations. Such transitions are only possible if the different 
vacuum field configurations do not vary at spatial infinity. Otherwise a transition would 
require an infinite kinetic energy (the integral J d 3 x(doAi) 2 would diverge) and an infinite 
action. 

We can regard U(x) as a continuous mapping of three dimensional space to the 
gauge group G. Equation 563 identifies spatial infinity as one point. This is equivalent to 
compactifying the spatial manifold R 3 to S* 3 . Thus, we only need to consider the maps 
of S 3 — > G. A theorem due to Bott [68] states that any continuous mapping of S 3 into a 
simple Lie group G can be continuously deformed to a mapping into an SU{2) subgroup 
of G. Therefore, for a gauge theory with simple group structure, it is sufficient to consider 
S 3 — > SU(2). The manifold of the £77(2) gauge group is also S 3 . Hence the matrix 
functions U are mappings of S 3 — > S 3 . These can be categorized into homotopy classes 
labelled by an integer, called the "winding number" (the number of times the spatial S 3 
is covered by the group manifold S 3 ) of the mapping [69]. Class n = represents the set 
of gauge transformations which do not wind the target S* 3 at all, and hence are homotopic 
(continuously deformable) to U = I (the mappings of all points in S 3 to the identity of 
S* 3 ); transformations of class n — 1 are those that wind the target S 3 only once. Remember 
that the most general element of 577(2) is of the form: 

U = exp (±i—x • c?) = v ± iv ■ a (565) 

where a are the Pauli matrices, vo = cos (0/2), v = xsin (0/2). As runs from to 2n, map 
Eq. 565 covers the SU(2) manifold once. Therefore, for the maps of class n — 1, replacing 
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by a continuous monotonic function u;(|x|) with appropriate boundary conditions, we 
can write the gauge transformations of the form 

Ui(x) = exp (±ia ■ xu(\x\)) . (566) 

To wind S* 3 once, as \x\ runs from to oo, should satisfy 

u(oo) = 7T, u(0) = . (567) 

As a representative of U±, choosing 

vfi + y 

we have 

iira l x l n\x\ _ „ 7r|x| 

cMx) = exp (± — :) = cos ( — ± la • xsm ( — -.) , (569) 

W V v/|f| 2 + A 2; Vl^l 2 + A2 vW+A 1 

where A is an arbitrary number. U\ can also be written, by making the deformation 



sin(7r|f|/A/|f| 2 + A 2 ) -> 2A|f|/(|f| 2 + A 2 ), as 



where <r M = (/, =R<?) and the unit vector 

«,= (|^f.-2A^). (571) 

As x ranges over all of 3 dimensional space, n M traces out a sphere S 3 in four dimensional 
spacetime. Thus, indeed, U\ maps a 3-space onto S* 3 ~ SU(2), covering it once. Equation 
563 is satisfied since Uoo — — 1. Map Eq. 569 can not be continuously deformed to identity 
(or to any other n = transformations) without violating Eq. 563. Indeed, Eq. 569 only 
yields a unique limit: 

Uoo = cos (0) ± ia • x sin (0) , (572) 
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as \x\ — > oo independently of the direction of approach if is an integer multiple of ir. 
Gauge transformations U n (x) of class n, can be taken as [Ui(x)] n and A m is calculated by 
402. If n — 0, then U = I and we have the trivial vacuum A m = 0. Ui given in Eq. 570 
yields 

A{x) = --(VU^U^ip!) 
9 

i OA 

= g W+W 2 [T<?(|f|2 " A ^ ± 2S( ° ' f) + 2A(f X * )] ' (573) 
where we used = j^p|^ 2 =F i2A an d the identity cr l (<7 • x) = a t a j x j = x l + 

i e vk x j a k j n q Uan tum considerations we will show that only n = homotopy class 
leaves the wave function of the system gauge invariant whereas the higher classes change 
the wave function by a phase. Therefore, homotopically equivalent transformations are 
also gauge equivalent. U\ transforms any classical vacuum configuration in class n to one 
in class (n+ 1). Hence, any gauge transformation of class (n + m) is a product of a member 
of class n with a member of class m. The existence of gauge transformations which are not 
homotopic to the identitity is what distinguishes the non-Abelian theory from the Abelian 
one. In the latter, all gauge transformations fall in the homotopically trivial n = class. 
Thus the vacuum is physically unique in that case. 

An analytic expression for the winding number of a gauge transformation U (x) can 
be given by introducing a second non-conserved topological charge [62]: 

Qr(t) = ^j J d 3 xJ (x,t), (574) 

where J M , defined in Eq. 475, satisfying T = (l/2g 2 )d^ = (1/4)G^*G^. Recall that 
first topological charge, defined in Eq. 482, was absolutely gauge invariant winding num- 
ber q[A] (or the Pontryagin index) of the Euclidean field configuration A a (x). The new 
topological number is related to the issue of how to connect different Minkowski vacua via 
tunneling events (instantons in Euclidean space). Physically QrlA^t] is a number which 
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characterizes a field configuration A^(t, x) in three dimensional space at a fixed time t. For 
an arbitrary field, Qr(t) is neither an integer nor has topological meaning. But for pure 
gauges A^(x) = — ^(d^U(x))U^ 1 (x), as shown below, Qr(t) G Z gives the homotopy class 
index (or winding number) of the gauge transformation U(x). Recall Eq. 497 that for a 
pure gauge 

Jo = -^e ijfe IV[(9 i C/)C/- 1 (9 j C/)C/- 1 (9 fe C/)C/- 1 ] . (575) 

Let x° = t be fixed and assume that U(t,x) — > 1 as \x\ — > oo. Using the Euler angles 9 a 
and going through the same procedure we used for q[A] in section A. 2.1, we obtain 

Q T (t) = J ^^(^i^a)^^)^.^) • ^[(Sa^^^^f/)^^^^)^ 1 ^ 

y de^dOsi— sin (0 2 )) = n . (576) 



2tt 2 

Note that can be changed by large gauge transformations whereas q is gauge invariant 
(Eq. 483). Integrating the four divergence <9 M J M (x) over space and assuming that J vanishes 
faster than 0{l/r 2 ) as r — > oo one obtains 

<9oQr(t) = <% y d 3 xJ = y d 3 x<9 M J M - (577) 

Integrating the above equation over time we find 

Q T (oo) -Q r (-oo) = q. (578) 

The interpretation of this formula is as follows. Euclidean solution A^(x, x 4 ) interpolates 
between the real time Minkowski potentials A^(t = ±oo,x) in the distant past and future. 
The imaginary time x^ = %T is the interpolating parameter. In general, this is not a 
tunnelling process since A^(t = —oo,x) and A^(t = oo,x) are not separated by a barrier. 
However, when the fields A^{t = ±oo,x) are pure gauges (i.e., vacua) so that the Qr(t = 
±oo) are their respective homotopy class labels, then the A^t = ±oo, x) are inequivalent if 
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Qr(t = — oo) 7^ Qr(t = oo) (i.e., they are separated by a barrier in Minkowski spacetime. 
Hence, solutions with q ^ describe tunneling events between topologically inequivalent 
vacua and later it will be shown that in Euclidean space the instanton in the A\ = gauge 
has exactly this behavior). To see this, observe that the field configuration of pure gauge 
A in Eq. 573 has zero potential energy (the term in the energy functional not containing 
derivatives of the field Ai with respect to time): 

j J dx 3 G^G mj = , (579) 

and that there is no zero energy evolution of the system which adiabatically connects 

— * 

configuration Eq. 573, where Qr(t = — oo) = 1, with the configuration A = 0, where 
Qr(t = oo) = 0. To examplify why all the paths joining the two field configurations in real 
time must go over an energy barrier, consider the class of field configurations [63] 

A^{x)= 1 Af\x), (580) 

where 7 G [0, 1] is a real time independent parameter and A^\x) is the pure gauge classical 
vacuum in the n = 1 class given in Eq. 573. We obtain the pure gauges A-i = and 
Ai = A} for 7 = and 7 = 1, respectively. They both give Gij = and hence zero energy 
for 7 = 0,1. For 7 G (0, 1), however, A\ 7 \x) is not a pure gauge. Although the electric 
field G 0t still vanishes because A^ = and A^ is time independent, the magnetic field 
Bi = {l/2)e ijk G jk does not 

Gjk = 7(^4 X) " d k Af) - ig^[Af\ A«] = -^( 7 2 - l)[Af\ a£>] ^ , (581) 

for 7 G (0, 1) . The energy, (-1/8) / d z xTi(GijG ij ) > for < 7 < 1, and proportional to 
(fi ,2 /8)(7 2 — l) 2 (note that J d 3 xTr(GijG tj ) is finite because Af^ falls as \x\~ 2 as \x\ — > 00 
as shown in Eq. 573). As 7 varies in the interval [0, 1], in field space A^ plots a path 
which connects the two classical vacua from n = and n — 1 sectors. The energy of the 
configuration can be interpreted as an energy barrier between the two vacua. However, (due 
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to the nonlinear nature of the Yang-Mills theory) does not solve the field equations, 
hence the path is classically forbidden; the barrier is impenetrable. In the quantum theory, 
tunneling will occur across this barrier. A semiclassical description of tunneling can be 
given by solving the classical equations of motion with imaginary time, thus achieving 
an evolution which is classically forbidden. The solutions with q ^ describe tunnelling 
between topologically inequivalent vacua. As we will show next, the instanton in the 
temporal gauge (A^ st ~ tcmp , A 4 = 0), has exactly this behavior. For x 4 = it — > — oo, 
^mst-temp _^ g)(diU (af))?7 _1 (a 7 ), whereas for X4 = it — > 00, ^ nst_tem P _> 0. The exact 
solution of the imaginary time equations of motion interpolates between the n = 1 and 
n = vacuum field configuration. Hence the instanton solutions with q 7^ describe 
tunnelling between topologically inequivalent vacua. This is another way of saying that we 
are going under an energy barrier. To be more practical [61], we will now use the explicit 
instanton solution given in Eq. 522: 



imst _ 1 x <r) ± x 4 a inst 3-x 



and demonstrate the relation between vacuum tunneling and instantons. To bring the 
instanton field to the temporal gauge we make a gauge transformation 

K 1St ( x ) "> 4 nst - temp (x) = [/(xj^^rHx) - l -{d,U{x))U-\x) , (583) 
such that 4 nst - tcmp (:r) = 0. This yields 

dJJ{x) = -igU(x)AT t - te ^ = ±iU{x) .. 2 °'* x2 , (584) 

\x\ z + x\ + \ l 

which can be integrated to give 

f (7 ' X \ 

U(x,X4) = exp I ±i f(x,X4) , (585) 



\x\ 



with 



\x\ 



fix, X4) = - , 

JK ' v/|f| 2 + A 2 



6*o + arctan 
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X4 



(586) 



By setting the integration constant 9 = | we find that 

lim /(£,ar 4 ) = 0, (587) 

X4^ — OO 

lim /(£,x 4 ) = = ^S~T2 • ( 588 ) 

Hence 

lim [/(£ , x A ) = 1 , (589) 

— * — * 

"77" /T . 

lim U(x , x 4 ) = C/i(f) = exp (±i ) , (590) 

where £/i(x) is given in Eq. 569. Since ^ nst - tcm P — the instanton field in the temporal 
gauge is 

^nst-temp = ^ j (xXa)±Xa _ , ^ ^ 

<? or + :r 4 + A 2 9 

The first term of the above equation vanishes as |:r 4 | — > 00. As x 4 — >■ =Foo, the second term 
yields the following limits: 

lim A mst ~ tcmp = , (592) 

X4— > — OO 

lim A inst " tcmp (f) = --(VC/i(£))C/f : (£) (593) 

which is given explicitly in Eq. 573. Thus, the instanton field in the limit X4 — > —00, is the 
trivial vacuum, whereas in the limit x 4 — > 00, it has the form of the pure gauge obtained 
from the large gauge transformations of class n — 1. This means that the instanton 
interpolates, in Euclidean time, between two field configurations differing by a large gauge 
transformation. 

A. 4 The Vacuum Angle 

Let us now study how quantum theory of the Yang Mills theory responds to gauge 
transformations [61, 66]. We work in the temporal gauge Aq(x) = 0. Recall that in this 
gauge the Hamiltonian is 

H= l -fSx (E?(x)E?(x) + B?(x)B?(x)) , (594) 
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where E a (x) = -dA a i (x)/dt and B a (x) = VA a (x) - y ahc A h x A c . The canonical equal 
time commutators are 

[A ta (x , t), A^ix 1 , t)] = [E ta (x, t), E jh {x', t)] = , (595) 

[E ia {x, t), A^ix 1 , t)] = i5 ij 5 ab 5{x - x 1 ) . (596) 

In the functional Schrodinger representation of the quantum theory [70], the operator Af(x) 
is diagonal. Let \a) be an eigenstate of A?(x) with eigenvalue (eigenfunction) a^(x): 

A«(x)\a)=a«(x)\a). (597) 

Af is an operator, is a function. In this representation, a functional differential repre- 
sentation of the conjugate momentum 

E -"^ = 'uk)- (598) 

is also a differential representation of the above commutators. The state is represented 
by the wave functional \P[a] — (a\^f). is an eigenstate of the above Hamiltonian, if the 
wave functional \l/[a] is a solution of the time independent functional Shrodinger equation: 

\ I ** {-WW**® + *w = ■ < M9 > 

where b a (x) = V x a a (x) — ^f abc a b (x)a c (x) is the eigenvalue of the diagonal and time 
independent magnetic field operator. Recall also that in the temporal gauge there is no 
Hamiltonian equation of motion corresponding to Gauss' law, D • E a = 0. Therefore, we 
must add this equation as a constraint on the states, D ■ E a \^) =0. The wave functional 
\&[a] that satisfy Eq. 599, must also satisfy the functional Gauss' law constraint 

D--^—^[a]= 0. (600) 
oa(x) 
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To interpret this constrained let's consider what happens to ^/[a] under a gauge transfor- 
mation. When we gauge transform a a , we cause a change in the functional ^[a\. Any 
change in the wave functional due to a change in a a is given by 

When the change in a a (x) is due to a gauge transformation, then (using Eq. 602) 

5a°(x) = -- g (d t u a (x) - gf abc cv b (x)a^x)) . (602) 
Therefore, a change in \P[a] due to 5a a (x) of Eq. 602, is 

5m= t ~y d 3 x [E t a (x)^[a]} (d^ix) - gf abc uj b (xK(x)) } , (603) 
where we used Sa 5 a ^ = —iEi a (x). Equation 603 can also be written as 

5^[a] = - I d 3 x fa [u a (x)Ei a (x)] - u a (x)diE a (x) - ^/ a6c cu b (f)<(f)^ a (f)} *[a] , (604) 
which gives 

5%a] = l -(^J dS- E a (x)oj a (x) - J dx 3 u a (x)(D ■ E) a ^j *[a] . (605) 

Notice that the Gauss' law operator (D ■ E) a acting on the wave functional \I/[a] has 
appeared in Eq. 605. If we assume 

Jim uj a (x) = 0, (606) 

the surface term vanishes, and if the wave functional satisfies the Gauss' law constraint, 
then the right hand side of Eq. 605 vanishes completely, and hence the wave functional 
is "invariant under a gauge transformation," 5^[a] = 0. On the other hand, if we want 
the wave functional to be gauge invariant, then the functional must satisfy the Gauss' law 
constraint and the gauge transformation must be of class n = which satisfies Eq. 606. 
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Remember that the transformations which satisfy Eq. 606 are only class n = gauge 
transformations since: 

Jim uj{x) = nit . (607) 

|x|— ►(» 

Thus, the Gauss' law constraint implies the physical wave functional to be "class n = 
gauge invariant." Thus, (D-E) a is the generator of class n = gauge transformations under 
which ty[a\ is invariant. Therefore, we will call the wave functionals obtained by gauge 
transformations satisfying Eq. 606 "gauge equivalent." Let us also note that, when we say 
"gauge invariant," we also mean invariant under time independent gauge transformations 
(i.e., uj a (x) is independent of time. We noted earlier that the temporal gauge condition 
does not fully fix the gauge. Any transformation that is time independent leaves A% — 0). 
Calling Q n the unitary operator which implements the gauge transformation in the nth 
homotopy class (operator representation of U n (x) = [Ui(x)] n ), we have £o^[o] = ^[a]. 
Next we consider the action of Q\ which takes the nth sector of vacua to the (n + l)th 
sector (hence Q n = (Gi) n )- For this class u — > ir as \x\ — > oo, hence gauge invariance of 
\I/[a] can not be forced by Gauss' law (Eq. 605). However, since Q\ commutes with all 
observables (they are gauge invariant), the only effect it can make on physical states is to 
change them by a phase: 

g^[a] = e- ie ty[a] and hence Q n ^[a\ = e' ine ^[a] , (608) 

where 6 is a free, real parameter. In other words, since under a gauge transformation 
of class n 7^ 0, ty[a] is not necessarily invariant, but since the Hamiltonian is locally 
gauge invariant, [H, (D ■ E) a ] = 0, all energy eigenfunctions can be chosen so that they 
change at most by a constant phase, which is the same for all eigenfunctions: ^[a n ] = 
G n ty[a\ = e _m0 \I/[a], where a„ is the transform of a by a class n gauge transformation 
a n = ([/ 1 )™a(£A)- n - ^(t/i) n V([/i)-"; (n = 0, ±1, ±2, . . .) This is the origin of the famous 
angle in Yang-Mills theory. We will denote the vacuum state wave functional by \&e[a] and 
the vacuum state characterized by 6 is called the "6-vacuum." 
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Now, we can ask what the physical vacuum looks like. We have seen that states in 
different homotopy sectors are not forced to be gauge equivalent by Gauss' law. We have an 
infinite number of distinct topological vacuum \n) in each sector. Non of these is the true 
vacuum, since they will tunnel into one another. This is expected since, as we have seen, 
the Yang-Mills theory has finite action instanton solutions. Therefore, assuming that the 
tunneling does occur, one can anticipate that the correct vacua will be linear superposition 
of the topological vacua and are given by |0) = J2°^oo e~ me \n). In the next paragraph, we 
make this claim more rigourously, and show that it is plausible. 

In Hilbert space, we represent the vacuum associated with the homotopy class index 
n by the state vector \n), with (n\m) = 5 nm . The topological charge operator Q T = 
(1/167T 2 ) J d 3 xJo has the property Qr\n) = n\n). Large gauge transformations change n 
so that: 

Q x \n) = \n + l), hence \n) = Q n \n = 0) . (609) 

Therefore [Qt,Gi] = Gi- The state vector \n) is not invariant under G n - However, the 
physical vacuum should be invariant, up to a phase 6, under gauge transformations (i.e., 
since [Gi,H] = 0, their eigenstates are common. Then the unitarity of Gi implies that the 
eigenstates |0) of H must satisfy = e _i6 |0)). Thus, we superimpose the n- vacua to 

form the so called 9 vacuum: 

oo 

|e> = eine \ n ) ■ ( 61 °) 

n=— oo 

Indeed, 

oo oo 

&|e)= e ine \n + l) = e- ie e i(n+1)0 |n + 1) = e^ |6) , (611) 

n=— oo n=— oo 

justifying Eq. 608. Different 6-vacua are orthogonal: 

(9'|e> = ^e^ e - me ')(m|n) = ^ e Me-e') = 2nS ^ Q _ Q ^ (612) 

n,m n 
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Thus, there exists a continuum of 6-vacua and each one is a suitable ground state for 
the physical gauge theory. No gauge invariant operator B ([B,Qi] = 0), can generate 
transitions between different 0-vacua[63]: 

(e'|£|e> = ^e ine ~ m& {m\B\n) = Y,e i(n+p)e - i{m+p)e ' (m + p\B\n + p) (613) 

m,n m,n 

where p is an arbitrary integer. Bracket (m + p\B\n + p) = (m\(Q\) p B(Qi) p \n) = (m\B\n), 
since [B, = and Q\ is unitary. Therefore, we have 

(e'|S|6) = e ^ e - ') ^e ine - ime ' {m\B\n) = e ip ^- @,) (Q'\B\Q) , (614) 

m,n 

which implies, for all p e Z, 

(l_ e v(e-e'))(e'|B|e) = 0, 

or (Q'\B\Q) = (Q\B\Q)5(Q — ©'). Hence, each |6) is the vacuum of a separate sector of 
states. 

Here, let us note that we claimed all physical observables commute with the large 
gauge transformation Q\ but we consider the wave functions \n) and \n + 1) which are 
related exactly by Qi as distinct. Although physics is the same (the observables have the 
same value) in \n) and in \n + 1), as we show below, the existence of distinct vacua changes 
the Lagrangian density by 

C e = -g|^ 6 *G a ^G% = |^ 6 E ai B ai . (615) 

The new O-term, Cq, is a total divergence; it does not effect the equations of motion. 
However, it changes the quantum theory nonperturbatively in a dependent manner. 
The relationship between the canonical momentum conjugate to A? and (Eq. 529), 
also changes 

n? = —E ai — B at . (616) 

1 87T 2 
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The angle in the Yang Mills theory, therefore, can not be avoided by postulating the 
(large) gauge invariance of the states. It reappears as an ambiguity in the definition of the 
Lagrangian, and hence in the canonical variables. Another difference introduced by the 
B-term is the violation of time reversal T and parity P symmetries. Under T, E± — > E*, 
while E>i — > — .Bj. Under P, E^ — > —E^ while E>i — > B^. Thus, under either T or P, 
C@ changes sign, while the original Lagrangian density, proportional to EfEf — BfBf, is 
invariant. 

To prove the claim Eq. 615, we recall the Feynman-Kac formula which gives the 
transition amplitude between n-vacua: 



lim (m\e 




(617) 



where H is the Hamiltonian of the system and, 

1 r T /^ r i r T /^ r 

Se = j dr d 3 xF^F^ = - dr d 3 x(E a ■ E a + B a ■ B a ) (618) 

is the Euclidean action between times — T/2 and T/2. Notice that in Eq. 617 we integrate 
over Euclidean gauge field configurations which interpolate between the vacua \n) and \m). 
Thus, the functional integral in Eq. 617 is restricted by the boundary conditions on the 
winding number <3t[A|: 

Qt[A(x, t = -T/2)] = n and Q T [A(x, r = T/2)] = m , (619) 

with q = m — n. In other words, in Feynman-Kac formula, for \T\ — > oo, we consider 
configurations which yield finite contributions. Since their Euclidean action is finite, 
must vanish at infinity in all Euclidean directions. This means that is a pure gauge 
field and yields a mapping from S" 3 to the group classified by the winding numbers. Then, 
we consider the transition amplitude A between 6-vacua: 

A = lim {&'\e- TH \e} = Ye i{ne ~ m&) lim (m\ e - TH \n} , (620) 

n,m 
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replacing the summation index m by p = (m — n), we have 

A = J2 e l(n0 - p0 '- ne,) lim (p + n\e~ TH \n) . (621) 

n,p 

The amplitude (p + n\e~ TH \n) = (p\e~ TH \0) since [H, G] = 0. Therefore, 

a = j2J2 ein(6 ' @ ' )e ~ tpe ^ m ^ e ~ TH ^ 

p n 

= 5(0 - 0') e ~ ipB [ \ dA \me- SE , (622) 

where the 5-function is obtained by summing over n, and Eq. 617 is used. Now we use the 
formula for topological charge q = (g 2 /32n 2 ) f d A xG a ^ u *G a ^ u) which is an integer, to replace 
p with q. Hence 

A = 5(0 - &) £ J [dA] 9 exp (-S E - ej d^xG^G^ 

= 5(0 - 0') J e~ s ^ , (623) 

where the integration is over all fields, and 

S% = J d 4 x E £* s = J d 4 x E (jG%G% + ^QG^G^j (624) 

d 4 x E Q(W + BfBf) + ^Q(-AEfBf)j . (625) 

To obtain the action in the Minkowski spacetime, we recall the necessary transformations: 
d A XE — ► id^XM, Ef — > —iEf. Then, the effective action transforms as 

S% - Jid 4 x(^(B?B?-E?E?) + ^e(±iE:B?)^ , 

-> i | d 4 x Qg^G^ + J^0G^/G a ^ , (626) 

where we dropped the index M and used Eqs. 454, 456, and 474. Since S E — > — i f d A x M C>% 
(Eq. 461), we find the effective action as 

£ m = _ _|!_ 0G ^*G^ . (627) 

158 



This proves claim Eq. 615. Since the O-angle is an arbitrary real parameter, whose absolute 
value can only be measured, the 9-term in Minkowski spacetime can also be chosen with 
a positive sign. 
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